Gene Rsph17029_2850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2850 
Symbol 
ID4897487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp3008402 
End bp3010369 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content67% 
IMG OID640113453 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001044724 
Protein GI126463610 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTTAGAA GACAGGTCGG ACACGGGGCG CGACAAACGG TCGAAATAAC AACCGGGGCG 
GCAAAAGGGC AGGAATTCCA GGTGAAGAAA CTCCTTTTCG GTCTCGTTGA CCGGCTGACC
AGAGCGCAGA AGCGTGGGCT TCTTCTTCTG GCGGATGTGC TCGTGGCGCC CCTTGCGCTG
CTGATCACCG GCATCTTCAT CCGCGCCCCC GGAGGCGAGC ACGAGTGGCT TCTCTTCCCC
GGCGCGGCGC TCTTTGCCTT CGGTCTCTCG CTGCTTTTCG GGATGCCCCG GATCAAGCTC
AACGCCTATG AGACCATGGC CATCCTGAAG ACCGGCGCCT TCGCTGCGGT GCTGACCCTC
GTGCTCTCGA TGCTCGCGTC GGTGGTGGGC ACCGCGGTGC CCGCGGCGGC GGCGATCCTC
TTCGGCCTTC TGTTCTTCAT CCTGTCGGTC GGCGCCCGGA TGGTCATGCT GCATGCGCTG
CTCTGGGTGC TGCAGATCGG CCAGAAGGGG TGCCGCGTGC TGATCTACGG CGCGGGCAAC
ACCGGCACGC AGCTCGCGGC GGCGCTCCGC TCGCGCGGAA CGATCCGGCC CATCGCCTTC
GTCGACGACA ATCCCGCGCT GCAGGCGATG GTGATCGCGG GGCTCCGCGT CTATCCGTCG
GACCGGATCG AGCGGCTGGT GCGGGAGCGC GACGTGTCCC GTGTGCTTCT TGCCATGCCC
TCGGAATCGC CCGCCAAACT TGCCCGGATC GCCCACCGGC TGCAGCTCGC GGGTGTCGAT
GTTCACACCG TGCCCTCCTT CGCGCAGCTC GTGGGCGAGG AGCAGCTGGT CGACAACCTG
TCTCCCTTCA CCTTCGGCCG TTTCCTCGGC CGCCAGCAGA TCGAGGATGC GCTGCCACAG
GGGGCCGATG CCTATGTCGG CCGCACGGTG CTGGTCTCGG GCGCGGGCGG CTCGGTCGGA
TCCGAGCTCT GCCGCCAGCT GCTGCTGATC CGTCCCCGGC GCATCGTCCT GTTCGAGATC
AGCGAGATCG CCCTCTACAC CATCGACCGC GAGCTGCAGG CGATGGCCGA AGGCACCGGG
GTCGAGATCG TGCCGGTCCT CGGATCGGTC ACCGATTCGC GGCTGTCGCG GATGGTGATG
CAGGATCACG GGGTCGAGGT GGTGTTCCAT GCAGCCGCCT ACAAGCATGT GCCGCTGGTC
GAGCACAATC CGATCGCGGG TCTGGCCAAC AATGTGCTGG GCACCCGGAC GCTGGCGGAT
GCCGCGCACG AGGCCGGCGT GGCGCGCTTC ATCCTGATCT CGACAGACAA GGCGGTGCGC
CCGACGAATG TCATGGGCGC CTCGAAGCGG CTGGCCGAGC TGGTGATTCA GGATCTCGCG
AAGCGGTCGA AGAAAACGAT CTTTTCGATG GTGCGGTTCG GCAACGTTCT CGGCTCGTCG
GGCTCGGTCA TCCCGCTCTT CAAGGAGCAG ATCGCCCGCG GCGGACCGGT CACGCTGACG
CACGAGGATG TCACCCGTTT CTTCATGACC ATCTCGGAAG CGGCACGGCT GGTGCTGCTG
GCGGGCTCCT TCGCCGATCC GGGCGATTGC CGTGGCGGCG ATGTGTTCGT GCTCGACATG
GGCAAGCCCG TGCGCATCCG CGATCTCGCC GTGCAGATGA TCGAGGCGGC CGGCAAGTCG
GTGCGCGATG AGCGCAACCC CTTCGGGGAC ATCGAGATTG TGGTCACGGG TCTGCGGCCC
GGCGAGAAGC TGCACGAGGA GCTGCTGATC GGCGAGGGGC TGCTGACCAC GCCGCACTCG
AAGATCCTGC GCGCTCAGGA GGAGAGCCTG TCCGAGCTCG AGATGGCCAC CGCGCTGCGG
GCGCTGCGCA GTGCCATGGC GGCCGGAGAC CCGCAGGCGG CCCGCCGGGT CATACTCTCC
TGGGTTGAGG GATATCGCCC TCCGGAAATC GTTGCCGCCG GGCGATAG
 
Protein sequence
MVRRQVGHGA RQTVEITTGA AKGQEFQVKK LLFGLVDRLT RAQKRGLLLL ADVLVAPLAL 
LITGIFIRAP GGEHEWLLFP GAALFAFGLS LLFGMPRIKL NAYETMAILK TGAFAAVLTL
VLSMLASVVG TAVPAAAAIL FGLLFFILSV GARMVMLHAL LWVLQIGQKG CRVLIYGAGN
TGTQLAAALR SRGTIRPIAF VDDNPALQAM VIAGLRVYPS DRIERLVRER DVSRVLLAMP
SESPAKLARI AHRLQLAGVD VHTVPSFAQL VGEEQLVDNL SPFTFGRFLG RQQIEDALPQ
GADAYVGRTV LVSGAGGSVG SELCRQLLLI RPRRIVLFEI SEIALYTIDR ELQAMAEGTG
VEIVPVLGSV TDSRLSRMVM QDHGVEVVFH AAAYKHVPLV EHNPIAGLAN NVLGTRTLAD
AAHEAGVARF ILISTDKAVR PTNVMGASKR LAELVIQDLA KRSKKTIFSM VRFGNVLGSS
GSVIPLFKEQ IARGGPVTLT HEDVTRFFMT ISEAARLVLL AGSFADPGDC RGGDVFVLDM
GKPVRIRDLA VQMIEAAGKS VRDERNPFGD IEIVVTGLRP GEKLHEELLI GEGLLTTPHS
KILRAQEESL SELEMATALR ALRSAMAAGD PQAARRVILS WVEGYRPPEI VAAGR