Gene Rsph17029_3906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3906 
Symbol 
ID4898737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1041469 
End bp1042500 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content70% 
IMG OID640114509 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001045756 
Protein GI126464643 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCA CGACCAGACT GAGCCGTCGC AGCTTCGGCC TTCTGACCGC CGGAAGCGTC 
GCGAGCCTTG CCATCGGGGC GCCGAGCCTG ATCCGCGCCC AGACGGCGGT GAATTTTGCC
GTCCCCAACC CCTCGGCCCT GACCTGGCTG CCCTACTGGG TGGCGGTGGG CGAAGGCTAC
TTCGCCGAAG AAGGCTTCGA GCCCCGGCTC GAGGCCATCG ACGGCTCTTC GGCCGTGCTT
CAGGCCATGT CGGCGGGACA GGCACAGATC GGCGCGCCGG GACCGGGCCC GACGCTCGGC
GCGCGCGCGC GCGGGGTGGA CGTCAAGTTC CTCTACAACC TCTATCCGAA GTCGGTCTTC
GGCCTGCTCG TGAAGGAGGA CAGCGCCTAT CAGACCCCGG CCGACCTCAA GGGCCAGGTC
ATCGGCGTGG GCACCGCGGA CGGGGCCGAG GTCTCCTTCA CCCGCGCCAT CCTGACCGAG
GCCGGCATGA CCGAGGGGGC CGATTACAGC TTCCTGCCGG TGGGCGACGG CGGCACGGCG
GCGGTGGCCT TCCTGCGCGA CGAGGTGGCG GCCTATGCGG GCGCGGTCTC GGATGCGGCG
ATCCTTGCCG CGCGCGGCCT CACGCTGCGC GAGATCACGC CCGAGGCCTT CCTCGGCTTC
TTCGGCAACG GCATCGCCAT GCTGGAAAGC CAGATGCAGG CCATGCCCGA GCTTGCCCCC
GCTTTCGGCC GGGCGCTGGT GCGCGGCACG CGCTTCGCCT CGGATCCGGC CAACAAGGAG
AAGGCACTGG CCCATTGCGC GGCCGGCAAC CCGCAGGAGG GCGAGCAGGA TTACGCGGCC
TCGCTCTATG ACGGCGTGGT CAACCGCATG ACCCCGACCG AGGCCTTCAT CGGCAAGGGC
TACGGCTACC AGCCGCCCGA GCACTGGCAG GCGATCCACG ATTCCGCCGT GGCTTCGGGC
GCCCTGTCCG AGCCGATCGA GGATCTGGCC TCGGTCTATA CCAACGAGTT CGTCGAAGGC
TGGAACAGCT GA
 
Protein sequence
MTSTTRLSRR SFGLLTAGSV ASLAIGAPSL IRAQTAVNFA VPNPSALTWL PYWVAVGEGY 
FAEEGFEPRL EAIDGSSAVL QAMSAGQAQI GAPGPGPTLG ARARGVDVKF LYNLYPKSVF
GLLVKEDSAY QTPADLKGQV IGVGTADGAE VSFTRAILTE AGMTEGADYS FLPVGDGGTA
AVAFLRDEVA AYAGAVSDAA ILAARGLTLR EITPEAFLGF FGNGIAMLES QMQAMPELAP
AFGRALVRGT RFASDPANKE KALAHCAAGN PQEGEQDYAA SLYDGVVNRM TPTEAFIGKG
YGYQPPEHWQ AIHDSAVASG ALSEPIEDLA SVYTNEFVEG WNS