Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1338 |
Symbol | |
ID | 4070627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1616987 |
End bp | 1617907 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637983347 |
Product | L-proline dehydrogenase |
Protein accession | YP_590414 |
Protein GI | 94968366 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0506] Proline dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAGAG CCGTCTTTAT CGGGCTATCC GAGAACAAAT CCCTGCGTCA TTTTGCCGAG AGCTCGGCGA TGGGCCGCCG CATGTCATCG CGCTTTGCGG CTGGACTGGA AGTAAAAGAT GCCGTCGCGG CAGCGAAGAA GCTGAATGCG ATGGGCGCCA CCGTGAGCAT CGATAACCTC GGCGAGAACG TGACCAACGC CGCCGAAGCA CGCGAGAGTG CAAAGCTCTA TCACGACATG CTCGATGAGC TTTCGAAGGA CGGACTGAAG GCGAACATCA GTTTGAAGCT GACCCACATG GGACTGGATG TGGACCCGGC GCTTTGCCAC GACCTGGTCT CCGAGCTGGT GGACCACGCG GTGAAGATCG ACAATTTCGT CCGCATTGAT ATGGAAGGCT CACCCTACAC GCAAAAGACA CTCGACTTCG TACATGAGCT GCATCGGATT CCTGGGCACT CGGGACATGT GGGCGCGGTG ATCCAGTCTT ATATGCGGCG CGCAGAAGCT GACGTCGAGA AGCTGTTGTC GGAGCGGATT CGCATACGCC TGTGCAAGGG AGCTTACAAA GAGCCGGCGG ACATCGCATT CCAGCAGAAG TCGGAAGTGG ATGCGAACTA TGTGAAGCTC GCCAAGACAC TGATCAAGAG CGGCGTGTAT CACGGGATCG CGACGCACGA TGAGAACATC ATCAACGAAC TGAAACAGTT CGCGAAGGCA GAGAGCATTC CGGCTTCGGC GTTTGAGTTC CAGATGCTTT ATGGCGTGCG GCGCGGCTTG CAGGAACAGC TGATTAAAGA AGGCTGGGGA CTGCGGGTGT ACGTACCATT CGGGACGGAG TGGTATCCGT ACTTTATGCG GCGACTGGCG GAACGTCCGG CGAACGCGTT GTTTATCGCT AAGAATATGT TCCGGAATTA G
|
Protein sequence | MLRAVFIGLS ENKSLRHFAE SSAMGRRMSS RFAAGLEVKD AVAAAKKLNA MGATVSIDNL GENVTNAAEA RESAKLYHDM LDELSKDGLK ANISLKLTHM GLDVDPALCH DLVSELVDHA VKIDNFVRID MEGSPYTQKT LDFVHELHRI PGHSGHVGAV IQSYMRRAEA DVEKLLSERI RIRLCKGAYK EPADIAFQQK SEVDANYVKL AKTLIKSGVY HGIATHDENI INELKQFAKA ESIPASAFEF QMLYGVRRGL QEQLIKEGWG LRVYVPFGTE WYPYFMRRLA ERPANALFIA KNMFRN
|
| |