Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0555 |
Symbol | |
ID | 4710563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 628541 |
End bp | 631708 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639855013 |
Product | bifunctional proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase |
Protein accession | YP_001002143 |
Protein GI | 121997356 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0506] Proline dehydrogenase [COG4230] Delta 1-pyrroline-5-carboxylate dehydrogenase |
TIGRFAM ID | [TIGR01238] delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.637208 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGT TCGTCCACCC GGAGCCCGAG CAGCTGCTGC CCGCCGAGCG CCGCGCCCTG GCCGCGGCCT ACCGCATCGA TGAACCCACC CGCAGCCGCG CCCTGCTTGA GGAGGCCGAC TGCGATCCGG CGACCCGCTC CCGTATCCAG GAGCGCGCCC GGGGCCTGGT CCACGGCATG ATCCGCGCCC AGCGCCGCCA GTCCTCGCTG ACCGCCCTGC TCCACGAGTA CGACCTCTCC AGCGAGGAAG GAGTGGCGCT GATGTGCCTG GCGGAGGCAT TGCTGCGTAT CCCCGACGCG CCGACTGCCG ATCAGCTGAT CCACGACAAG CTCGGCACCG GGGGCTGGTC GTCGCACCTG GGGCGCGACC GGCGGTTGCT GGTCAACGCC GCCACCCTCG GCCTGGCCCT GACCGGCCGG ATCCTGGACA CCCGCGACGC CGAGCGCTGG TTTGGAGATC GCCTCCACAC CGCCATCGCC CGTCGCGGGG CCCCGCTGAT CCGCCGGGCG GTGCGGCGCT CGATGGGACT GCTTGGTGAG ACCTTCGTCC TCGGCCGCGA CATCCCCGAG GCGCAGCGGC GCGCCCGCAA ACTCGAAGCC AAGGGGTACC GCTACTCCTA CGACATGCTC GGCGAGGCGG CGCGCACCGA GGCCGACGCC GAGCACTTCT TCCAGGCCTA CTGCCGCGGC ATCGAGCACT TCGGGCGCAG TGCCGACCCG GATGCGCCCA TGGACGCCCG GGCCGAGGTC TCGGTGAAGC TCTCGGCCCT CGACCCGCGC TTCGAGCCCG GTCAGGAGGA ACGCGTCCAG GCCACGGTCA TCCCCCGCTT GCAGGCGCTG TGCCGGCGCG CCCGGGAGGC CGGCATCGCC CTATGCGTGG ACGCCGAGGA GGCCGCCCGG ATCGACCTGA CACTGGATGT GCTTGAGGCG GTGATGGCCG ACCCGGAACT CGCCGACTGG GACGGGCTGG GAATTGCGGT GCAGGCGTAC CAGAAGCGCG CCCCGGAGTG GATCGACTGG CTGGCCGAGC GGGCCGGGCA CTACCGGCGC CGCCTGCGCA TCCGGCTGGT CAAGGGCGCC TACTGGGATA CCGAGATCAA GGACTCACAG ATCCAGGGCC TGGACGACTA CCCGGTCTTC ACCCGCAAGG CGGCCAGCGA TGTCTGCTTC CTGGCCTGCG CCCGGCGCAT GCTCCGCCAC CCGCAGCAGA TCTATCCGCA GTTCGCCACC CACAACGCCC ACACCGTCGC CGCGGTGATG GAACTGGCCG ACGAGCAGCC CTTCGAGTTT CAGCGCCTGC ACGGCATGGC TGATGACCTC TACGATCAAC TGGTCGACGC CCGCCCCGGC CGCGGGGTGC CGGTGCGCAT CTACGCCCCG GTGGGCCAGC ACGAGGCGCT GCTGCCCTAT CTCGTCCGCC GGCTGCTGGA GAACGGCGCC AACTCCTCCT TCGTCAATCG GATCCACGAA GGCGACGTGG AAGAGCTGAT CGCCGACCCG GTGGAGCACC TGCGCTCGCG GACGACCCTG CGCCACCCGC ACCTGCCGCT GCCGTCCGGG ATTTTCGGCC CGGAGCGGGT CAACAGCCGC GGTATCGACT TCTCCAACCG GCAAGAGACC GCCGCCCTGG CTGCTGCCAT GACCACCGCC GCCGAGCCTG CCCGCGAGGC GCGGCCGATC ATCAACGGCC AGGGCGCCAC CGAGGCCGAC GGCACCTGGG CCGAGGTCTG CAGCCCCACG GATACCGCCC AGCACGTCGG CCGGGTGCTG TGGGCCGGCC ACGAGCACCT GGAGCAGGCC CTAGCCAGTG CCGCAGCGGC CTGGCCGCGC TGGGCCGCGA CGCCGGTGGA CGAGCGCGCC CGAGCCCTGG AGCGCCTCGC CGACCTCTAC GAGGCGCACA CCGCCGAGCT GATGACCCTG TGCACCCTGG AAGGCGGCAA GACCCTTAAG GACGGCATCG CCGAGGTGCG CGAGGCGGTG GACTTCTGCC GCTACTACGC CGTCCAGGCG CGGCGGCTGA TGGGCGAACC GACCCCGTTA CCCGGCCCCA CCGGCGAGAC CAACGCCCTG CAGCTGCACG GCCGCGGCAC CTACCTGTGC ATCAGCCCCT GGAACCTGCC GCTGGCGATC TTCACCGGGC AGATCACCGC CGCCCTGGCC GCCGGCAATG CGGTGATCGC CAAGCCGGCG GAGCAGACGC CGCTGATCGC CCACCGAGCC GTGGAGCTCA TGCATCAGGC GGGCATCCCC GGTGACGTGC TCCACCTTCT GCCCGGCGAG GGCCGGCGCA TCGGGCCGCC GCTGGTGGCC GATCGGCGGA TCGACGGGGT CGCCTTCACC GGCTCGGTGG CCACGGCGCA GCAGATCCAC CGCACCCTGG CCGAGCGCGA TGGCCCCATC GTTCCGCTGA TCGCCGAGAC CGGCGGCCTC AACGCCCTGA TCGTCGACTC CAGCGCCCTG CCCGAGCAGG CGGTGGTGGA TGTACTGCGC TCGGCGTTCT TCAGCGCCGG CCAGCGCTGC TCGGCGCTGC GCCTGCTGTG CATCCAGGAG GACATCGCTG AGCCGTTCCT GGCGATGCTG CGCGGGGCCA TGGACGCCCT GCGGGTTGGT GATCCGCGCT GGCTGGCCAC CGACGTCGGC CCGGTGATCG ACAGCGACGC CCGGGCGCGG CTGGAGGCCC ATCACGAGGC GATGGCCGCC GCCGGCCGCG TGGTCCATCG GACACCGCTC GGCCAGGCCG GAGAGCGCGG GCACTTTGTG CCGCCGTCGC TCTACCGGCT GGATGCCATC GAGGATCTGC AGGAGGAGTT CTTCGGGCCG ATGCTCCACT ACACCACCTG GCGGGCGGGG GAGCTCGACT CGGTGGTGGA GCGGATCAAC GCCGCCGGCT ACGGTCTCAC CTTCGGCGTC CACAGCCGCA TCGACAGCCA CCGGGAGATG GCCACGCGCT CGATCCGCGC CGGAAACGCC TACGTCAACC GGGACATCGT CGGCGCGGTG GTGGGCTCCC AGCCCTTCGG CGGCGAGGGG CTGTCGGGTA CCGGGTTCAA GGCCGGCGGA CCGAACTATC TGCTGCGCTT CGTCAACGAA CGGGTGGTCA CCGAGAACAC GGCGGCGGCC GGCGGCAACG CCTCCCTCTT CGCCTTGGGG GAGGACGACG AGGCGTAA
|
Protein sequence | MSAFVHPEPE QLLPAERRAL AAAYRIDEPT RSRALLEEAD CDPATRSRIQ ERARGLVHGM IRAQRRQSSL TALLHEYDLS SEEGVALMCL AEALLRIPDA PTADQLIHDK LGTGGWSSHL GRDRRLLVNA ATLGLALTGR ILDTRDAERW FGDRLHTAIA RRGAPLIRRA VRRSMGLLGE TFVLGRDIPE AQRRARKLEA KGYRYSYDML GEAARTEADA EHFFQAYCRG IEHFGRSADP DAPMDARAEV SVKLSALDPR FEPGQEERVQ ATVIPRLQAL CRRAREAGIA LCVDAEEAAR IDLTLDVLEA VMADPELADW DGLGIAVQAY QKRAPEWIDW LAERAGHYRR RLRIRLVKGA YWDTEIKDSQ IQGLDDYPVF TRKAASDVCF LACARRMLRH PQQIYPQFAT HNAHTVAAVM ELADEQPFEF QRLHGMADDL YDQLVDARPG RGVPVRIYAP VGQHEALLPY LVRRLLENGA NSSFVNRIHE GDVEELIADP VEHLRSRTTL RHPHLPLPSG IFGPERVNSR GIDFSNRQET AALAAAMTTA AEPAREARPI INGQGATEAD GTWAEVCSPT DTAQHVGRVL WAGHEHLEQA LASAAAAWPR WAATPVDERA RALERLADLY EAHTAELMTL CTLEGGKTLK DGIAEVREAV DFCRYYAVQA RRLMGEPTPL PGPTGETNAL QLHGRGTYLC ISPWNLPLAI FTGQITAALA AGNAVIAKPA EQTPLIAHRA VELMHQAGIP GDVLHLLPGE GRRIGPPLVA DRRIDGVAFT GSVATAQQIH RTLAERDGPI VPLIAETGGL NALIVDSSAL PEQAVVDVLR SAFFSAGQRC SALRLLCIQE DIAEPFLAML RGAMDALRVG DPRWLATDVG PVIDSDARAR LEAHHEAMAA AGRVVHRTPL GQAGERGHFV PPSLYRLDAI EDLQEEFFGP MLHYTTWRAG ELDSVVERIN AAGYGLTFGV HSRIDSHREM ATRSIRAGNA YVNRDIVGAV VGSQPFGGEG LSGTGFKAGG PNYLLRFVNE RVVTENTAAA GGNASLFALG EDDEA
|
| |