Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4018 |
Symbol | |
ID | 5086192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 48398 |
End bp | 49399 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640485576 |
Product | hypothetical protein |
Protein accession | YP_001170176 |
Protein GI | 146280019 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
TIGRFAM ID | [TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.416304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGTCA AGGTTGCCAT CAACGGCTTC GGCCGCATCG GCCGCAACGT GCTCCGCGCC ATCGTGGAAT CGGGGCGCAC CGACATCGAG GTGGTCGCGA TCAACGATCT GGGGCCGGTC GAGACCAACG CGCATCTCTT GCGCTTCGAC AGCGTGCACG GCCGCTTCCC GGCCACCGTG ACCACCGGAG AGGACTGGAT CGACGTGGGC CGCGGCCCGA TCAAGGTCAC CGCGATCCGC AACCCGGCCG AGCTGCCCTG GGCGGACGTG GATGTGGCGA TGGAATGCAC CGGCATCTTC ACCACGAAGG AGAAGGCCGC GGCGCACCTT CAGAACGGCT CGAAGCGGGT GCTGGTCTCG GCCCCCTGCG ACGGCGCGGA CAAGACCATC GTCTATGGCG TGAACCACGC CACGCTGACC GCCGGGGACA TGGTCGTGTC GAACGCGTCC TGCACCACGA ACTGCCTCTC GCCGGTCGCC AAGGCGCTCA ACGACGCGAT CGGGATCGCC AAGGGCTTCA TGACCACGAT CCACAGCTAT ACGGGCGACC AGCCGACGCT GGACACGATG CACAAGGATC TCTACCGGGC GCGGGCCGCG GCGCTGAGCA TGATCCCGAC CTCGACCGGG GCGGCGAAGG CGGTGGGCCT CGTGCTGCCG GAACTGAAGG GCAGGCTCGA CGGCGTGGCG ATCCGGGTGC CGACGCCGAA CGTCTCGGTG GTGGATCTCG TGTTCGAGGC CGCGCGCGAC ACCACGGTGG AGGAGGTCAA CGCCGCCATC GAGGCCGCGG CCGACGGCCC GCTGAAGGGC GTGCTGGGCT ACACGAAACA GCCCAATGTC TCGTCCGACT TCAACCACGA CCCGCATTCG TCGGTGTTCC ACATGGACCA GACCAAGGTG ATGGAGGGCC GGATGGTCCG CATCCTCAGC TGGTACGACA ACGAATGGGG CTTCTCGAAC CGGATGGCCG ACACCGCCGT GGCGATGGGC CGGCTTCTCT GA
|
Protein sequence | MTVKVAINGF GRIGRNVLRA IVESGRTDIE VVAINDLGPV ETNAHLLRFD SVHGRFPATV TTGEDWIDVG RGPIKVTAIR NPAELPWADV DVAMECTGIF TTKEKAAAHL QNGSKRVLVS APCDGADKTI VYGVNHATLT AGDMVVSNAS CTTNCLSPVA KALNDAIGIA KGFMTTIHSY TGDQPTLDTM HKDLYRARAA ALSMIPTSTG AAKAVGLVLP ELKGRLDGVA IRVPTPNVSV VDLVFEAARD TTVEEVNAAI EAAADGPLKG VLGYTKQPNV SSDFNHDPHS SVFHMDQTKV MEGRMVRILS WYDNEWGFSN RMADTAVAMG RLL
|
| |