Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3446 |
Symbol | |
ID | 4243390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5269160 |
End bp | 5272141 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638108423 |
Product | L-proline dehydrogenase / delta-1-pyrroline-5-carboxylate dehydrogenase |
Protein accession | YP_723012 |
Protein GI | 113476951 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0506] Proline dehydrogenase [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative [TIGR01238] delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGAAC AAATATCCCA TACTAACTAC GAAACTAAAA CTCAAGAAAT AGCTAAACAA CTCTTGAAAG CAACTCAGGA AAAAAATCGT TCTTTCTTGG CTCAATTGCA AAATCAAATG CGGTGGGATG ATAAGCTCCT TGACTGGGCA ATGGCAAGTC CTGGTTTGCG AGTCCAACTA TTTCGTTTTA TTGACTGTTT ACCAGCATTA CGGAGTAAAC CAGAAATAGC TGCTCACCTC CAAGAATATC TGACTACTCA AGAAGTCGAA CTTCCAGAAA CTCTCAAAAA ACTACTCAAT TTTGCTAACC CAGACTCTGT TCCTGGTCAA CTTGCAGCGA CTACTGTTGC ACCTGCAGTG GAAACTCTCG CTCATAAATA TATTGCTGGG GAAAATGTTA AGCAAATCAT CAAAACCCTA GAAAAACTAC GGAAAGATAA AATGTGCTTT ACTGTCGACC TCCTTGGAGA AGCAGTCATC ACTGAAACTG AAGCACAGTT ATATTTAGAC CGTTATCTTG AACTGATGAC TCAACTTTCC CAAGCTGCTA ATAGCTGGTC ATCAATTCCC CAAATAGATG AAGCAGAAGG GGAAAAACTA CCACGAGTAC AAGTATCGGT AAAACTTACA GCATTTTATT CTCAGTTTGA TCCTCTTGAT GTTAAAGGTA GTCAGGAAAG AGTGAGTGAT CATGTACGCA CTCTACTCAG ACATGCTCAA AAACTGGGTG TAGCAGTTCA CTTTGACATG GAACAATATA CTTATAAAGA CCTGACTATT GATATTCTTA AAGAATTGTT GATGGAAGAA GAATTTCGCA ACCGTAATGA TATCGGTGTG ACAATTCAAG CATATTTGCG GGATAGTGAA AAAGATCTGC AAGGCATTAT TGAATGGGCA AAAATTAGAG GTCGTCCCGT TACTGTTCGA CTTGTAAAAG GAGCCTATTG GGATCAGGAA ACTATCAAGG CTTTACAACA TGACTGGCCC CAACCAGTAT TTAATGATAA ACCTGAAACT GATGCCAACT TTGAAAAGCT GACTTATATG ATGCTAGAAA ATCATGAATA TATTTACTCA GCTATTGGTA GTCATAATGT TCGTTCTCAA GCGAGAGCGA TCGCTATTGC TGAAACTCTA AAAGTTCCTC GGCGGTGCTT TGAGATGCAG GTACTCTATG GTATGGGTGA CAAAATAGCT ACAATATTAG CAGAAAAGGG TTATCGCGTC CGAGTTTATT GCCCCTACGG TGATCTTTTA CCAGGAATGG CATATTTAAT TCGTCGCCTC CTAGAAAATA CTGCTAACAG TTCATTTCTC AAACAAAGTT TAGAAAACCG GCCTTTAGAA GAATTATTGG CAGCACCAAG TATTAATGGT AAAACTACAA TTCACGATGT GGTCAAACCA GTTTTCCCTA ATGCTGCCGA TAGCGACTAC GCTAATTTCC AACAACGGCA ACAAGCTTTG AATACTATTG GTCAAATACG ACTAGAGTTA GGAAAAACCT ATCTGCCAAT TATTAATGGT GAATACACAA ACACAGCACA AATAGTTGAT TCACTGAACC CATCAAATTC TAAGGAGGTT GTTGGCAAAA TTGGTTTAAT ATCGGTAGAA CAAGCAGAGC AGGCAATTCA AGCAGCCAAA GCAGCGTTTC CAAGTTGGAA GAAAACACCA GTTCGAGAAC GGACAAGGAT ATTGCGCAAA GCTGCTGACT TAATGGAAAA ACGTCGTCAT GAGTTGGCAG CATGGATGGT TTTAGAGGTT GGTAAACCTC TTAACCAAGG TAATGCAGAA GTTTCTGAGG CGATAGATTT TTGTCGTTAT TATGCTGATG AGATGGAACG GTTAGAGCAG GGCTATAATT ATGATGTCGC TGGAGAAACT GACCGTTATT ATTATCAACC TCGTGGTATT TCTTTAGTAA TTTCTCCTTG GAATTTTCCT CTGGCTATTC CTACTGGAAT GACTGTTTCG TCTCTGGTGA CAGGTAATTG TACTTTGTTG AAGCCAGCAG CAGTATCATC GGTTATTGCT TCTAAAATTT CTGAGATTTT GCTAGAAGCT GGTTTTCCTA AGGGGGTATT TCAATTTGTT CCGGGAAATG GTTCCACTGT GGGTGATTTT ATGGTGAAGC ACCCAAGTGT TAATAGTATT ACTTTTACTG GTTCGATGGA AGTGGGTTGT CACATTTATG CTCAAGCTGC AGTTTTGCAA CCAAGGCAAA AACATTTGAA GCGAGTTATT GCTGAAATGG GTGGTAAAAA TGCGATTATT GTAGATGAAA GTGCAGATCT TGACCAAGCA GTTGCTGGTG TTGTTTATTC TGCTTTTGGA TATAGTGGAC AGAAATGTTC GGCTTGTTCG CGGGTTGTAG TATTGGCAAC AATTTATGAT AGTTTTGTGA ATCGTTTGGT GGAGGCTACG CGATCGCTCA ATATAGGGGA TGCGGAGAAG CCAAGTACAC AAGTTGGTCC GGTAATTGAT GAGAAAGCGC AAAAGCGGAT TCAGGAATAT ATTGTGAAAG GTAAGCAGGA AGCAGAAGTT GCTATTGAGA TGAGTGCGCC GAAGGATGGT TATTTTGTTG GTCCTGTGAT ATTTAAGGAT GTGTCGCCAA GTGCAACTAT TGCTCAGGAA GAAATATTTG GTCCTGTGTT GGCAGTAATG AAGGCGGATA ATTTTACTCA AGCTTTGGAA ATTGCTAATG GGACTAATTT TGCTTTAACT GGGGGGTTAT ATTCCCGTAC TCCTTCTCAT ATTGAACAAG CAAAAGCAGA GTTTGAAGTG GGGAATTTAT ATATTAATCG TGGAATTACT GGAGCGATCG TTTCTCGACA ACCATTTGGT GGGTTTAAGT TGTCTGGGGT GGGTTCAAAG GCGGGAGGAC CGGATTATTT ACTGCAGTTT TTGGAACCGC GCACGATAAC TGAGAATATA CAAAGGCAAG GTTTTGCACC TATAGAGGGT GTGGATAATT GA
|
Protein sequence | MVEQISHTNY ETKTQEIAKQ LLKATQEKNR SFLAQLQNQM RWDDKLLDWA MASPGLRVQL FRFIDCLPAL RSKPEIAAHL QEYLTTQEVE LPETLKKLLN FANPDSVPGQ LAATTVAPAV ETLAHKYIAG ENVKQIIKTL EKLRKDKMCF TVDLLGEAVI TETEAQLYLD RYLELMTQLS QAANSWSSIP QIDEAEGEKL PRVQVSVKLT AFYSQFDPLD VKGSQERVSD HVRTLLRHAQ KLGVAVHFDM EQYTYKDLTI DILKELLMEE EFRNRNDIGV TIQAYLRDSE KDLQGIIEWA KIRGRPVTVR LVKGAYWDQE TIKALQHDWP QPVFNDKPET DANFEKLTYM MLENHEYIYS AIGSHNVRSQ ARAIAIAETL KVPRRCFEMQ VLYGMGDKIA TILAEKGYRV RVYCPYGDLL PGMAYLIRRL LENTANSSFL KQSLENRPLE ELLAAPSING KTTIHDVVKP VFPNAADSDY ANFQQRQQAL NTIGQIRLEL GKTYLPIING EYTNTAQIVD SLNPSNSKEV VGKIGLISVE QAEQAIQAAK AAFPSWKKTP VRERTRILRK AADLMEKRRH ELAAWMVLEV GKPLNQGNAE VSEAIDFCRY YADEMERLEQ GYNYDVAGET DRYYYQPRGI SLVISPWNFP LAIPTGMTVS SLVTGNCTLL KPAAVSSVIA SKISEILLEA GFPKGVFQFV PGNGSTVGDF MVKHPSVNSI TFTGSMEVGC HIYAQAAVLQ PRQKHLKRVI AEMGGKNAII VDESADLDQA VAGVVYSAFG YSGQKCSACS RVVVLATIYD SFVNRLVEAT RSLNIGDAEK PSTQVGPVID EKAQKRIQEY IVKGKQEAEV AIEMSAPKDG YFVGPVIFKD VSPSATIAQE EIFGPVLAVM KADNFTQALE IANGTNFALT GGLYSRTPSH IEQAKAEFEV GNLYINRGIT GAIVSRQPFG GFKLSGVGSK AGGPDYLLQF LEPRTITENI QRQGFAPIEG VDN
|
| |