Gene Rsph17025_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4037 
Symbol 
ID5086210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp72514 
End bp74163 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content71% 
IMG OID640485600 
Producthypothetical protein 
Protein accessionYP_001170194 
Protein GI146280037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.677701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGAGC CCCACGTGCT GATCCTGGGC TCGGGTCCCG GCGGGGCGGC GCTCGCCTGG 
CGGCTCGCCT CGGCTGGGCT TGCGGTGCGG GTGCTCGAGG CGGGGCCCGC CTTCGATCCG
GCCACCGATT ATGCCCAGGA CCGCGCAGAC TGGGAGACTC CCTTCCCCGA GCGGCCCGGC
AGCCGCGGCG CCTGCGAGAC GGGACCCCTG CAGGAGCTTG GGTTCGAAAT CGACGACATC
CGCTCATGGA ACGCGCTGAC CGGGCCTTAC GTCCCGGGGA CGCGGCGGGC TGACTTCGGC
TATCACCATG TACGGGGCGT AGGCGGCAGC TCGCTTCACT TTACCGGCGA GGCGCACCGG
CTGCATCCCC GTGCATTCAC GATGAAGAGC ACGTTCGGCG TGGCGGCGGA CTGGCCCGTG
ACCTATGCCG AACTCGAGCC CTACTGGCTC GAGGCCGAGC GGCAGTCGGG TGTGGCGGGA
CCGGCCGAGG ATGCACAGCG GCCGCGGAGC GCGCCCTATC CGCTGCCCGC GCATCCCTTC
AGCCATGCCA GCGACCGCCT CGCCCGCGCC GCGCGGAGCC TGGGCCTCTC GGTGCAGGCC
AATGCGCTGG CCGTGCCATC GCGCCCTTAT GACGACCGGC CCGACTGCAA CTATTGCGGC
GGCTGCCTGC GCGGTTGCCA GCGGGGCGAC AAGGGCAGCG TGGACCAGAC CTACCTGCGC
AAGGCCGTAG AGACCGGGCG CTGCGAAGTG CTGCCGGGGA TTGAGGCGAT GCGGCTCGAG
ACGGCGGGGG GACGGGTGAG CGGCGTCCTC TGCGCGACGT CGGCGGGTCC GCGCCTCTTC
CGTGCGCCGG TTGTGATCCT GGCCTGCGGC GCGGTGCAGA CGCCGCGGCT CCTGCTGAAC
TCGGCCTCGG AGGAGAGCCC CGACGGGCTC TGCAACGAGA GCGGCGAGGT GGGGCACAAC
TTCATGGAAA CACTCATATT TACCGCAAGC GCGCTTCATT CCGAGCCCTT GGGCAGCCAC
CGCGGCCTGC CCGTCGACTG GATCTGCTGG GACTTCAATG CACCCGACGC GATCCCGGGC
GTCACGGGCG GCTGCCGCTT CGGCTGCTCG ATGGCCGAGA GCGATCTGGT GGGCCCCGTA
GCCTATGCGA CCCGGGTGGT CGGGGGCTGG GGCCGTGCCC ACAAGCGCGC GCTGCGCGCC
AGTTTCGGGC GCGCGCTGTC GGTCACCGGG ATCGGCGAGT GCCTGCCCCA TCCCGAAAGC
CGGATCCGCC TCTCGACGCG ACGTGACGCG CATGGGATGC CGATCCCGCG GATCGAGAGC
CGCCTCGGGC CCGACGCCTT TGCTCGGCTG CGCTTCATGG CCCGGACCTG CCGGGCTATC
CTTGCCGCCG CAGGCTGCGC CGCGCCCTTC GAGGAATTCA GCTCGGCTGA CGCCTTTTCC
TCGACCCATG TCTTCGGCAC CTGCCGCATG GGCCATGATC CCATGCGGAA CGTTGTGGAC
GGATGGGGCC GCAGCCACCG CTGGCCGAAC CTCTTCGTCG CCGACGCAAG CCTCTTTCCC
TCAAGCGGCG GCGGCGAGTC TCCCGGTCTC ACGATCCAGG CACTGGCACT GCGGACGGCC
GACCATCTGC TGTCGGAAGC CCGTCCATGA
 
Protein sequence
MTEPHVLILG SGPGGAALAW RLASAGLAVR VLEAGPAFDP ATDYAQDRAD WETPFPERPG 
SRGACETGPL QELGFEIDDI RSWNALTGPY VPGTRRADFG YHHVRGVGGS SLHFTGEAHR
LHPRAFTMKS TFGVAADWPV TYAELEPYWL EAERQSGVAG PAEDAQRPRS APYPLPAHPF
SHASDRLARA ARSLGLSVQA NALAVPSRPY DDRPDCNYCG GCLRGCQRGD KGSVDQTYLR
KAVETGRCEV LPGIEAMRLE TAGGRVSGVL CATSAGPRLF RAPVVILACG AVQTPRLLLN
SASEESPDGL CNESGEVGHN FMETLIFTAS ALHSEPLGSH RGLPVDWICW DFNAPDAIPG
VTGGCRFGCS MAESDLVGPV AYATRVVGGW GRAHKRALRA SFGRALSVTG IGECLPHPES
RIRLSTRRDA HGMPIPRIES RLGPDAFARL RFMARTCRAI LAAAGCAAPF EEFSSADAFS
STHVFGTCRM GHDPMRNVVD GWGRSHRWPN LFVADASLFP SSGGGESPGL TIQALALRTA
DHLLSEARP