Gene Tery_5031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_5031 
Symbol 
ID4246686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7687850 
End bp7690774 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content40% 
IMG OID638109840 
Productglycine dehydrogenase 
Protein accessionYP_724416 
Protein GI113478355 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGCAA CATATAAATC TAATATACAG TCAAGTTATC AAATACAACT GGCCAACCAA 
AACCAAGGTC AGAGGCCAAT AGACTTTTCC CAACGACATA TTGGTCTTAC CTCATCAGAA
ATCCAACAAA TGCTGGAAGT ATTAGGTATT TCCTCCCTAG AAGACTTAAT TGACAAAACA
GTTCCCGAAA AAATTCGATT CCAGAAACCA CTCAATTTGC CCAAGTCTCT GAGTGAAAAT
GCGGCACTTG CTCAAATTAA AGAAATAATC TCTAAAAATC AGATATTTCG TTCCTTTATT
GGAATGGGTT ATTATGACTG CATTACCCCA CCAGTTATCC TTCGCAATAT ACTAGAAAAC
CCTGGTTGGT ACACAGCTTA TACTCCCTAT CAAGCAGAAA TAGCTCAAGG CCGGATGGAG
GCGTTGCTGA ATTTTCAAAC CATGATTACA GACTTAACAG GTTTAGAAAT AGCTAATGCT
TCACTACTTG ATGAAGCTAC AGCAGCAGCG GAAGCGATGA GCATGACTTA TGGTTTATGT
AAAACTAAAG CCGAAGTTTT CTTTGTTGAC TCTGCCTGCC ATCCTCAGAA TATTGAAGTT
GTCAAAACTA GGGCACAACC ATTGGGAATA GAAGTAATAG TCGGGGACTT CCGAACCTTC
ACTTTTGACA AACCAATTTT TGGTGCACTC CTGCAATATC CCGCCACTAA TGGAGCAATT
TATGACTATC GCGAATTTGT GGAAAAAGTT CACAAGGTAG GAGGTTTAGT AACTGTTGCT
GCTGAATTAC TAAGTTTAAC CTTACTCACA CCCCCTGGAG AATTTGGTGC TGATATTGCT
GTAGGTAATA CTCAACGTTT TGGCGTGTCT CTTGGATACG GAGGCCCCCA TGCTGCTTAC
TTTGCTACTA AAGAAGCTTA TAAACGACAA ACTCCTGGAC GTATTGTTGG GGTTTCTCAA
GATGCTAATG GTAACCCAGC CTTACGTTTA GCACTACAAA CCAGGGAACA GCATATTCGC
CGGGAAAAGG CAACTAGCAA TATTTGTACT GCTCAAGTTT TATTGGCAGT AATAGCAGGT
ATGTACGCAG TCTATCACGG TCCGGGTGGT TTAAAACAAA TTGCAGAAAA TATCCATAAT
TTGACTTTTA AGTTGGCAAC AGGTTTAAAA CAGCTTGGTT ATCAAATTGG TGCAGAGTTA
TTTTTTGATA CTATTGAGAT CAAATTGGGT GCTGACTCTC CTGTGAAAAG TGCAAAAGAA
ATTATTGATG CGGCTGAAAA TTTAGGTATT AATCTCCGAA CTTTTGATGA ACAAACGGTT
GGTATTTCTC TAGATGAAAC TACCACTGAA GTAGATGTAC AAAATCTGTG GCAAATTTTT
GCTAGTGGAG AAAAGTTCCC AAACATCGAA AATGAAAATA TTTCTACCCT TTCTCAAAGT
TATTATGCTC GCACTAGCAA TTATCTAACT CACCCAGTAT TTAAAAGTTA TCATTCTGAA
ACCAATCTTT TGCGTTATAT TCATCGTTTG CAGTCTAAAG ATTTATCTTT GACAACATCA
ATGATACCTT TGGGTTCCTG TACAATGAAA CTTAATGCTA CAGCAGAAAT GATACCTGTA
ACTTGGCCTG AATTTGCCAA TATTCACCCT TTTTCTCCCA TTTCTCAAAC TCAGGGTTAT
CAAATAATAT TTCAGCAATT AGAAGAATGG TTGGCAGAAA TCACAGGTTT TGCAGAAATT
TCTCTACAAC CAAACGCAGG TTCACAAGGA GAATATACGG GTTTATTAGT AATTCGCGAA
TATCATGCTC ACCGTGGGGA AGCACATCGT GATATTTGTT TGATCCCTGA ATCTGCCCAC
GGTACAAACC CTGCTTCTGC AGTAATGAGT GGGTTAAAGG TTGTGGTTGT TAAATGTGAT
GCCCAAGGCA ATATAGATAT TGCAGATTTA CAGACAAAGG CAGAAAAGCA TAAAGATAAT
TTAGCTGCAA TAATGATTAC ATACCCCTCT ACTCACGGTG TCTTTGAGGA AGAAATTCTT
GATATTTGTG AAATTATCCA TGCTCATGGT GGGCAAGTTT ATATGGATGG GGCAAATATG
AATGCTCAAG TAGGATTATG TCGTCCCGCG GAAATAGGTG CTGATGTTTG TCATTTGAAT
TTACATAAAA CCTTTTGTAT TCCTCATGGT GGCGGAGGTC CAGGAATGGG TCCGATAGGG
GTTAAGTCTC ACTTGGCACC GTTTTTACCA GGGCATTCTG TTATTAATTT GGGAGGGGAA
AATTCTAGTG GAGCTGTATC TGCTGCACCC TGGGGTAGCG CTAGTATTCT GCCTATTTCT
TGGATGTATA TTGCGATGAT GGGGACGGAT GGTTTGACTG AAGCAACTAA GATAGCGATT
TTGAATGCTA ATTATATTGC CCAACGTTTG GGAAGTTATT ATTCAGTTTT GTACAAGGGT
AAGTATGGGT TTATTGCTCA CGAGTGCATT TTGGATTTGC GTCCTTTGAA AAAGTTGGCT
GGTATTGAGG TGGAGGATAT TGCTAAACGT TTGATGGACT ATGGTTTTCA TGCGCCGACT
GTCTCTTGGC CTGTGGCGGG TACAATTATG GTGGAACCGA CAGAGAGTGA GTCTAAGGAT
GAGTTAGACC GTTTTTGTGA CGCGATGATT TCTATTCGTC AGGAAATAGA GGAGATCGAA
ACTGGTAAGG CAGATAAAAA TGATAATTTG TTGAAAAATG CGCCTCATAC TGCTGAGAGT
TTGATGGTGG ATGAGTGGAA GCATGGTTAT TCTCGACAAC GTGCTGCTTA TCCTGCGCCT
TGGACGCGAG AGCATAAATT TTGGCCTGCT GTAGGACGGG TTGATAATGC TTTTGGGGAT
CGCAATTTTG TTTGTTCTTG TTTGCCGATA GAGGCTTACA GTTAA
 
Protein sequence
MVATYKSNIQ SSYQIQLANQ NQGQRPIDFS QRHIGLTSSE IQQMLEVLGI SSLEDLIDKT 
VPEKIRFQKP LNLPKSLSEN AALAQIKEII SKNQIFRSFI GMGYYDCITP PVILRNILEN
PGWYTAYTPY QAEIAQGRME ALLNFQTMIT DLTGLEIANA SLLDEATAAA EAMSMTYGLC
KTKAEVFFVD SACHPQNIEV VKTRAQPLGI EVIVGDFRTF TFDKPIFGAL LQYPATNGAI
YDYREFVEKV HKVGGLVTVA AELLSLTLLT PPGEFGADIA VGNTQRFGVS LGYGGPHAAY
FATKEAYKRQ TPGRIVGVSQ DANGNPALRL ALQTREQHIR REKATSNICT AQVLLAVIAG
MYAVYHGPGG LKQIAENIHN LTFKLATGLK QLGYQIGAEL FFDTIEIKLG ADSPVKSAKE
IIDAAENLGI NLRTFDEQTV GISLDETTTE VDVQNLWQIF ASGEKFPNIE NENISTLSQS
YYARTSNYLT HPVFKSYHSE TNLLRYIHRL QSKDLSLTTS MIPLGSCTMK LNATAEMIPV
TWPEFANIHP FSPISQTQGY QIIFQQLEEW LAEITGFAEI SLQPNAGSQG EYTGLLVIRE
YHAHRGEAHR DICLIPESAH GTNPASAVMS GLKVVVVKCD AQGNIDIADL QTKAEKHKDN
LAAIMITYPS THGVFEEEIL DICEIIHAHG GQVYMDGANM NAQVGLCRPA EIGADVCHLN
LHKTFCIPHG GGGPGMGPIG VKSHLAPFLP GHSVINLGGE NSSGAVSAAP WGSASILPIS
WMYIAMMGTD GLTEATKIAI LNANYIAQRL GSYYSVLYKG KYGFIAHECI LDLRPLKKLA
GIEVEDIAKR LMDYGFHAPT VSWPVAGTIM VEPTESESKD ELDRFCDAMI SIRQEIEEIE
TGKADKNDNL LKNAPHTAES LMVDEWKHGY SRQRAAYPAP WTREHKFWPA VGRVDNAFGD
RNFVCSCLPI EAYS