Gene Tery_3446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3446 
Symbol 
ID4243390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5269160 
End bp5272141 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content40% 
IMG OID638108423 
ProductL-proline dehydrogenase / delta-1-pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_723012 
Protein GI113476951 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0506] Proline dehydrogenase
[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative
[TIGR01238] delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAAC AAATATCCCA TACTAACTAC GAAACTAAAA CTCAAGAAAT AGCTAAACAA 
CTCTTGAAAG CAACTCAGGA AAAAAATCGT TCTTTCTTGG CTCAATTGCA AAATCAAATG
CGGTGGGATG ATAAGCTCCT TGACTGGGCA ATGGCAAGTC CTGGTTTGCG AGTCCAACTA
TTTCGTTTTA TTGACTGTTT ACCAGCATTA CGGAGTAAAC CAGAAATAGC TGCTCACCTC
CAAGAATATC TGACTACTCA AGAAGTCGAA CTTCCAGAAA CTCTCAAAAA ACTACTCAAT
TTTGCTAACC CAGACTCTGT TCCTGGTCAA CTTGCAGCGA CTACTGTTGC ACCTGCAGTG
GAAACTCTCG CTCATAAATA TATTGCTGGG GAAAATGTTA AGCAAATCAT CAAAACCCTA
GAAAAACTAC GGAAAGATAA AATGTGCTTT ACTGTCGACC TCCTTGGAGA AGCAGTCATC
ACTGAAACTG AAGCACAGTT ATATTTAGAC CGTTATCTTG AACTGATGAC TCAACTTTCC
CAAGCTGCTA ATAGCTGGTC ATCAATTCCC CAAATAGATG AAGCAGAAGG GGAAAAACTA
CCACGAGTAC AAGTATCGGT AAAACTTACA GCATTTTATT CTCAGTTTGA TCCTCTTGAT
GTTAAAGGTA GTCAGGAAAG AGTGAGTGAT CATGTACGCA CTCTACTCAG ACATGCTCAA
AAACTGGGTG TAGCAGTTCA CTTTGACATG GAACAATATA CTTATAAAGA CCTGACTATT
GATATTCTTA AAGAATTGTT GATGGAAGAA GAATTTCGCA ACCGTAATGA TATCGGTGTG
ACAATTCAAG CATATTTGCG GGATAGTGAA AAAGATCTGC AAGGCATTAT TGAATGGGCA
AAAATTAGAG GTCGTCCCGT TACTGTTCGA CTTGTAAAAG GAGCCTATTG GGATCAGGAA
ACTATCAAGG CTTTACAACA TGACTGGCCC CAACCAGTAT TTAATGATAA ACCTGAAACT
GATGCCAACT TTGAAAAGCT GACTTATATG ATGCTAGAAA ATCATGAATA TATTTACTCA
GCTATTGGTA GTCATAATGT TCGTTCTCAA GCGAGAGCGA TCGCTATTGC TGAAACTCTA
AAAGTTCCTC GGCGGTGCTT TGAGATGCAG GTACTCTATG GTATGGGTGA CAAAATAGCT
ACAATATTAG CAGAAAAGGG TTATCGCGTC CGAGTTTATT GCCCCTACGG TGATCTTTTA
CCAGGAATGG CATATTTAAT TCGTCGCCTC CTAGAAAATA CTGCTAACAG TTCATTTCTC
AAACAAAGTT TAGAAAACCG GCCTTTAGAA GAATTATTGG CAGCACCAAG TATTAATGGT
AAAACTACAA TTCACGATGT GGTCAAACCA GTTTTCCCTA ATGCTGCCGA TAGCGACTAC
GCTAATTTCC AACAACGGCA ACAAGCTTTG AATACTATTG GTCAAATACG ACTAGAGTTA
GGAAAAACCT ATCTGCCAAT TATTAATGGT GAATACACAA ACACAGCACA AATAGTTGAT
TCACTGAACC CATCAAATTC TAAGGAGGTT GTTGGCAAAA TTGGTTTAAT ATCGGTAGAA
CAAGCAGAGC AGGCAATTCA AGCAGCCAAA GCAGCGTTTC CAAGTTGGAA GAAAACACCA
GTTCGAGAAC GGACAAGGAT ATTGCGCAAA GCTGCTGACT TAATGGAAAA ACGTCGTCAT
GAGTTGGCAG CATGGATGGT TTTAGAGGTT GGTAAACCTC TTAACCAAGG TAATGCAGAA
GTTTCTGAGG CGATAGATTT TTGTCGTTAT TATGCTGATG AGATGGAACG GTTAGAGCAG
GGCTATAATT ATGATGTCGC TGGAGAAACT GACCGTTATT ATTATCAACC TCGTGGTATT
TCTTTAGTAA TTTCTCCTTG GAATTTTCCT CTGGCTATTC CTACTGGAAT GACTGTTTCG
TCTCTGGTGA CAGGTAATTG TACTTTGTTG AAGCCAGCAG CAGTATCATC GGTTATTGCT
TCTAAAATTT CTGAGATTTT GCTAGAAGCT GGTTTTCCTA AGGGGGTATT TCAATTTGTT
CCGGGAAATG GTTCCACTGT GGGTGATTTT ATGGTGAAGC ACCCAAGTGT TAATAGTATT
ACTTTTACTG GTTCGATGGA AGTGGGTTGT CACATTTATG CTCAAGCTGC AGTTTTGCAA
CCAAGGCAAA AACATTTGAA GCGAGTTATT GCTGAAATGG GTGGTAAAAA TGCGATTATT
GTAGATGAAA GTGCAGATCT TGACCAAGCA GTTGCTGGTG TTGTTTATTC TGCTTTTGGA
TATAGTGGAC AGAAATGTTC GGCTTGTTCG CGGGTTGTAG TATTGGCAAC AATTTATGAT
AGTTTTGTGA ATCGTTTGGT GGAGGCTACG CGATCGCTCA ATATAGGGGA TGCGGAGAAG
CCAAGTACAC AAGTTGGTCC GGTAATTGAT GAGAAAGCGC AAAAGCGGAT TCAGGAATAT
ATTGTGAAAG GTAAGCAGGA AGCAGAAGTT GCTATTGAGA TGAGTGCGCC GAAGGATGGT
TATTTTGTTG GTCCTGTGAT ATTTAAGGAT GTGTCGCCAA GTGCAACTAT TGCTCAGGAA
GAAATATTTG GTCCTGTGTT GGCAGTAATG AAGGCGGATA ATTTTACTCA AGCTTTGGAA
ATTGCTAATG GGACTAATTT TGCTTTAACT GGGGGGTTAT ATTCCCGTAC TCCTTCTCAT
ATTGAACAAG CAAAAGCAGA GTTTGAAGTG GGGAATTTAT ATATTAATCG TGGAATTACT
GGAGCGATCG TTTCTCGACA ACCATTTGGT GGGTTTAAGT TGTCTGGGGT GGGTTCAAAG
GCGGGAGGAC CGGATTATTT ACTGCAGTTT TTGGAACCGC GCACGATAAC TGAGAATATA
CAAAGGCAAG GTTTTGCACC TATAGAGGGT GTGGATAATT GA
 
Protein sequence
MVEQISHTNY ETKTQEIAKQ LLKATQEKNR SFLAQLQNQM RWDDKLLDWA MASPGLRVQL 
FRFIDCLPAL RSKPEIAAHL QEYLTTQEVE LPETLKKLLN FANPDSVPGQ LAATTVAPAV
ETLAHKYIAG ENVKQIIKTL EKLRKDKMCF TVDLLGEAVI TETEAQLYLD RYLELMTQLS
QAANSWSSIP QIDEAEGEKL PRVQVSVKLT AFYSQFDPLD VKGSQERVSD HVRTLLRHAQ
KLGVAVHFDM EQYTYKDLTI DILKELLMEE EFRNRNDIGV TIQAYLRDSE KDLQGIIEWA
KIRGRPVTVR LVKGAYWDQE TIKALQHDWP QPVFNDKPET DANFEKLTYM MLENHEYIYS
AIGSHNVRSQ ARAIAIAETL KVPRRCFEMQ VLYGMGDKIA TILAEKGYRV RVYCPYGDLL
PGMAYLIRRL LENTANSSFL KQSLENRPLE ELLAAPSING KTTIHDVVKP VFPNAADSDY
ANFQQRQQAL NTIGQIRLEL GKTYLPIING EYTNTAQIVD SLNPSNSKEV VGKIGLISVE
QAEQAIQAAK AAFPSWKKTP VRERTRILRK AADLMEKRRH ELAAWMVLEV GKPLNQGNAE
VSEAIDFCRY YADEMERLEQ GYNYDVAGET DRYYYQPRGI SLVISPWNFP LAIPTGMTVS
SLVTGNCTLL KPAAVSSVIA SKISEILLEA GFPKGVFQFV PGNGSTVGDF MVKHPSVNSI
TFTGSMEVGC HIYAQAAVLQ PRQKHLKRVI AEMGGKNAII VDESADLDQA VAGVVYSAFG
YSGQKCSACS RVVVLATIYD SFVNRLVEAT RSLNIGDAEK PSTQVGPVID EKAQKRIQEY
IVKGKQEAEV AIEMSAPKDG YFVGPVIFKD VSPSATIAQE EIFGPVLAVM KADNFTQALE
IANGTNFALT GGLYSRTPSH IEQAKAEFEV GNLYINRGIT GAIVSRQPFG GFKLSGVGSK
AGGPDYLLQF LEPRTITENI QRQGFAPIEG VDN