Gene Rsph17029_1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1812 
Symbol 
ID4896912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1911735 
End bp1912715 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID640112406 
ProductABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components TauA 
Protein accessionYP_001043691 
Protein GI126462577 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.661868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.196706 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGAC TGCTGACCGG AGCCGCGCTT CTCGCGCTCG CCGCCGGCAC GGCTTCGGCC 
GAGGAGGTGA CGCTCCAGCT CAAATGGGTG ACGCAGGCCC AGTTCGCGGG CTACTACGTC
GCGCTCGATC AGGGCTTCTA CGAGGAGGAG GGGCTCGAGG TCACGATCAA GCCGGGCGGC
CCCGACGTGG CGCCGGTGCA GGTGCTGCTC GGCGGCGGCG CCGACGTGAT GGTCGACTGG
CTGCCCTCGG CGCTGGCCGC GCGCGAGCAG GGGGCCGACA TCGTCAACAT CGCCCAGCCC
TTCAAGAGCT CGGGCATGAT GCTGACCTGC CTGAAGGAAT CGGGCGTTTC GGGCCCCGAG
GACTTCAAGG GCAAGACGCT GGGCGTCTGG TTCGGCGGCA ACGAATATCC CTTCCTCAAC
TGGATGTCCA AGCTGGGCCT GCCCACCGAC GGCTCGCCTC AGGGGGTGAC GGTGCTCAAG
CAGGGCTTCA ACGTCGATCC GCTGCTGCAG AAGCAGGCGG CCTGCATCTC GACCATGACC
TACAACGAAT ATTGGCAGGT GATCGACGCG GGCCTCTCGC CGGACGACCT CGTGACCTTC
AAATACGAGG ATCAGGGCGT GGCGACCCTC GAGGACGGTC TCTATGTGAT GGCCGACAAG
CTGAAGGATC CGGCCTTCGT CGAGACCATG GCCAAGTTCG TGCGGGCCTC GATGAAGGGC
TGGAAATGGG CCGAGGAGAA CCCCGACGAC GCGGCCATGA TCGTGCTCGA CAATGACGAC
ACGGGCGCGC AGACCGAGAG CCACCAGAAG CGGATGATGG GCGAGGTGGC TAAGCTGACC
GCGGGCTCGG ACGGCACGCT CGACGAGGCG GATTACAAGC GCACCGTGGC GACCCTGATG
GGCGGCGGCT CGGATCCGGT GATCTCGAAG GAGCCTGAGG GCGCCTGGAC CCACGAGGTC
ACCGACAAGG CGCTGAAGTA A
 
Protein sequence
MKGLLTGAAL LALAAGTASA EEVTLQLKWV TQAQFAGYYV ALDQGFYEEE GLEVTIKPGG 
PDVAPVQVLL GGGADVMVDW LPSALAAREQ GADIVNIAQP FKSSGMMLTC LKESGVSGPE
DFKGKTLGVW FGGNEYPFLN WMSKLGLPTD GSPQGVTVLK QGFNVDPLLQ KQAACISTMT
YNEYWQVIDA GLSPDDLVTF KYEDQGVATL EDGLYVMADK LKDPAFVETM AKFVRASMKG
WKWAEENPDD AAMIVLDNDD TGAQTESHQK RMMGEVAKLT AGSDGTLDEA DYKRTVATLM
GGGSDPVISK EPEGAWTHEV TDKALK