Gene Emin_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1033 
Symbol 
ID6263649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1124062 
End bp1127370 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content41% 
IMG OID642611513 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_001875923 
Protein GI187251441 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATTC TTAATTTTTT AAAACCTAAA AACGGTAAAA AACAAAAAGT TTTATTGCTC 
GGTTCGGGTG CGTTATCAAT AGGCCAGGCG GGCGAGTTTG ATTATTCCGG CTCGCAGGCT
ATTAAAGCTT TGGAGGAAGA AGGCTTAGAA GTAATTGTTC TTAACCCTAA CATAGCCTCC
GTGCAGACAA ACCCCGCTCC CAACAAAAAG ATTTACCTTT ACCCCGTAAC ACCTTTTTGG
ATTGAAAAAA TTATTAAAAA AGAAAGACCC GTGGCTTTAA TAGCGGGATT CGGCGGCCAG
ACCTCTTTAA ACTGCGCTAT CGAACTTCAT AATAACGGCG TTTTAAAAAA ATACGGCGTT
AAAGTTTTAG GCACGCCAGT AAGCTCTTTA GAAATGTCCG AAGACAGAGA TTTATTTTCC
AAAAGAATGC ATGAAATAGG CGTTCCCACC CCTCCCAGTA AAGCGGTTGA AACCGTGGAA
GAAGCTTTAA AAACAGCGCT TGAAATAGGC TACCCCGTTA TAACCCGCTC GGCCTACGCT
TTAGGCGGTT TAGGCAGCGG TTTAGCTGAA AACCCCGAAC AACTTGAAAA GCTGGCCTCC
TCTGCGCTTA CTTCCAGCCC GCAGATTTTA ATTGAAAAAT CCCTCCACGG CTGGAAAGAA
ATTGAATATG AGGTAATGCG CGACGCGTGC GGGAACTCCA TAACAATTTG TAATATGGAA
AACTTTGACC CCATGGGCAT ACACACGGGC GACTCCATTG TTATAGCGCC GTGCCAAACC
TTAAATAACA GGGAAAATAA TATGCTCCGC GACGCGGCTT TAAATATTGT TAAAAGCATA
GGCGTTGTGG GTGAATGTAA CGTCCAATTC GCTTTAAGCC CTTTTACGCT TGAATATTAC
GTAATTGAAA TTAACGCAAG GCTTTCACGC TCAAGCGCTT TGGCAAGCAA AGCCACGGGT
TACCCGATAG CGTTTGTGGC GGCCAAGGTT GTAAGCGGTT TTGATTTACT TGAACTTAAA
AACCCCGTTA CGGGCACAAC GTCCGCTTTT TACGAACCGT CGCTTGACTA TGTTTCATTA
AAAGTACCTA GATGGGATTT GAAAAAATTT ACCGGCGTTT CTAAAGAATT AGGCACGCAG
ATGAAGTCCG TGGGTGAAGT TATGTCCATA GGACGCAACT TTTGCGAGGT TGTGCAAAAG
GCCCTTCGCA TGGTGCAGGA AGACGAAGAA GGCTTAATGA AAGAAGTTTT TGCCGGTACG
TCCGATAAAG AGCTTCTTAA AGAAGCCGCG CACCCTACAA ACTTAAGAAT TTTTGCCATT
TATGAACTTT TTAAAAGAGG TTTTAGCGTA GATAAAGTAA AAAATGTTAC CAAGATTGAA
CCTTGGTTTT TAAGCCATTT ATTTTACTTA GCAAAACTTG AAAATGAAGT AGCCACATTT
TTTAAAGGCG TAAAAGCGCC CAAAAAACTT ACGGCCGACT TTATAAAAAA ACAGTTTACC
AATATAGATA CGGAATATTT GAGAAGGTTA AAAAGCAGGG GATTTTCCGA CTACCAGCTT
ACAAAACTTT TACTTTCCGT AATTTCCCCC AAAGAGAAAT TTACGAATAA AGAAATTAAT
TCTTTATCTT TAGGACTTAG GGAATTAAGA AAGAAAATGA ATATAGTGCC CGTGGTTAAA
CAAATTGACA CAACCTCCGC CGAATATTAC ACGACTTCAA ATTATTTGTA TCTTACTTAC
GACGGCACTC ACAATGATAT CACGCTAAAA AAGAAAAACA AAAGTATTAT TACGCTTGGC
AGCGGCAGCT ACCGCATAGG CAGCAGTTTG GAATTTGACT GGTGCTCTGT TATGACAAGC
AAATACTTTA AACAGCAAAA AGACGACAGT ATTATTATTA ACTGCAACCC CGAAACCGTC
TCGACCGACT TTAACAGCTC GGACAGATTA TATTTTGAGG AACTTTCTTT TGAGCGCGTT
ATGGATATTA TTGATTTTGA ATCCCCCAAA GGCGTTGTGG CCTGCATGGG CGGGCAAAAC
CCCAATAACC TTACGCCTTA TTTAAGCAGA GTGGGAGTTA ATATTTTAGG GCACAGTTTT
GAAACCGTTG AAAAAGCGGA AAACAGAACA AAGTTTTCAG CAATACTGGA TTCTTTAAAT
ATAGACCAGC CCAAATGGAC GTCAGCCGCT TCAAGAAAGG AAGTTAATGA CTTTGTTAAA
GAAGTAGGCT TTCCTGTTCT TATAAGACCG AGTTTTGTTT TATCGGGCAC GCTTATGAAC
GTGGCTAACG ACCAAAAATC TTTGGACTAC TATTTGTCGC TTACCAAAGA TATTTCGGCA
GATTACCCTG TGGTGCTGTC CCAGTTTATT TTAGACGCCA AGGAACTTGA GTGCGACGGC
GTTGCCAAAA ACGGCGAGGT GCTGCTTTCC TTTATTTCCG AACACGTTGA AAACGCAGGC
GTTCACAGCG GCGACGCTAC ATTGGTTTTT CCAGCGGAAA AAATTTATAC AAAAACAGCC
AATTCAATTA GAGATATCGT TAGAAAAATA GCTAAAGGGC TTAACCTTAA CGGGCCTTTT
AATATACAGT TTATAGCTAA AGATAACGAC GTAAAAGTAA TTGAGTGTAA CGCGCGCGCT
TCACGCTCGT TCCCGTTTAT AACAAAAGTT TCGGGCCAAA ACCTGGCGGA GTTTTCCTGC
AAAGTCATGA ACAATGAAAA AGTTGATAAA GTGTTTATGG ATGAGTCGGA AATTCCTTAT
ACAGGCGTTA AAGCAAGCAT GTTCAGCTTT CAAAGGCTTG ACGGAGCCGA CCCTATTTTA
GGAGTTGAAA TGGCCTCCAC CGGCGAAGTG GGCTGCATAG GGGCAAATTT TAACGAGGCC
ATGCTTTTGG CTATGGAATC AACGCATATA AAAATGCCAA AAAAAGGTAT TTTACTAAGT
ACTGGCCGTG AAAAAGATAA GATTAAATTT ATGGAAGTGA TTGATAACGT CTATAAATTC
GGTCTGCCTG TTTACGCTAC GCTGGGCACC GCAAATTACC TTAAAGAACA CGGCTATGAT
GCTATACCCG TTATGTACCA CCATGACCCA AAGCCCGTTG ACGTAATAAA ACAACGCAAG
GTGGACTTTG TGGTTAACGT GCATAAAAGC TTGGAACTTG ACGAGCTTGA ACATAACTCG
GCCATAAGAA AAACGGCTGT TAAATCAAAC TGTTCGCTTT TAACAAACCT TGAAAAAGCG
ATAGCTTATT TTAAAGCGTT TGATTCTTAT AAAGCATTAT CTGAAAAAGA CGACTTAATA
CATTTGTAA
 
Protein sequence
MPILNFLKPK NGKKQKVLLL GSGALSIGQA GEFDYSGSQA IKALEEEGLE VIVLNPNIAS 
VQTNPAPNKK IYLYPVTPFW IEKIIKKERP VALIAGFGGQ TSLNCAIELH NNGVLKKYGV
KVLGTPVSSL EMSEDRDLFS KRMHEIGVPT PPSKAVETVE EALKTALEIG YPVITRSAYA
LGGLGSGLAE NPEQLEKLAS SALTSSPQIL IEKSLHGWKE IEYEVMRDAC GNSITICNME
NFDPMGIHTG DSIVIAPCQT LNNRENNMLR DAALNIVKSI GVVGECNVQF ALSPFTLEYY
VIEINARLSR SSALASKATG YPIAFVAAKV VSGFDLLELK NPVTGTTSAF YEPSLDYVSL
KVPRWDLKKF TGVSKELGTQ MKSVGEVMSI GRNFCEVVQK ALRMVQEDEE GLMKEVFAGT
SDKELLKEAA HPTNLRIFAI YELFKRGFSV DKVKNVTKIE PWFLSHLFYL AKLENEVATF
FKGVKAPKKL TADFIKKQFT NIDTEYLRRL KSRGFSDYQL TKLLLSVISP KEKFTNKEIN
SLSLGLRELR KKMNIVPVVK QIDTTSAEYY TTSNYLYLTY DGTHNDITLK KKNKSIITLG
SGSYRIGSSL EFDWCSVMTS KYFKQQKDDS IIINCNPETV STDFNSSDRL YFEELSFERV
MDIIDFESPK GVVACMGGQN PNNLTPYLSR VGVNILGHSF ETVEKAENRT KFSAILDSLN
IDQPKWTSAA SRKEVNDFVK EVGFPVLIRP SFVLSGTLMN VANDQKSLDY YLSLTKDISA
DYPVVLSQFI LDAKELECDG VAKNGEVLLS FISEHVENAG VHSGDATLVF PAEKIYTKTA
NSIRDIVRKI AKGLNLNGPF NIQFIAKDND VKVIECNARA SRSFPFITKV SGQNLAEFSC
KVMNNEKVDK VFMDESEIPY TGVKASMFSF QRLDGADPIL GVEMASTGEV GCIGANFNEA
MLLAMESTHI KMPKKGILLS TGREKDKIKF MEVIDNVYKF GLPVYATLGT ANYLKEHGYD
AIPVMYHHDP KPVDVIKQRK VDFVVNVHKS LELDELEHNS AIRKTAVKSN CSLLTNLEKA
IAYFKAFDSY KALSEKDDLI HL