Gene Rsph17029_1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1894 
Symbol 
ID4897469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2006963 
End bp2008363 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content69% 
IMG OID640112488 
ProductTolC family type I secretion outer membrane protein 
Protein accessionYP_001043770 
Protein GI126462656 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.170185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.445756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGA TCAGACTGTG GGCCGTGGCG ACCTGCTCGG CTCTGGCGGT GATGGCCTCG 
TCCGCGGCAC AGGCGGAGAC GCTCGCCGAC GCCCTCATCT CCGCCTACCG CAACAGCAAC
CTGCTCGAAC AGAATCGCGC GCTTCTGCGG GCGACCGACG AGGATGTGGC GGTGGCCGTC
GCGGCTCTGC GCCCCGTCGT GCAGTTCGTC GCGCAATCGA CCTACAGCTT CCAGCGGGTT
CATGCGGATG CGACGCTCCT GACCCCTGCC GGGCGGACCA ATGTCGAGAA CCTGAACTCC
TCGGTCGGGC TCACCGCCTC GATGACGCTC TATGACTTCG GGCGCAATGC GCTCGCCGTC
GAGGCCGCCA AGGAGACGGT GCTCGCCACC CGCGAGGCGC TGGTGCAGGT CGAGCAGAAC
GTGCTGCTCG ATGCGGTCAA TGCCTATGTG CAGGTGCAGC TTGCGCAATC CATCGTCAAT
CTGCGGCGCA ACAACCTCGG GCTGATCGAT CAGGAACTGC AGGCGGCGCA GGACCGGTTC
GACGTGGGCG AAGTCACCCG CACCGACGTG TCGCAGGCGC AGGCCGCGCT GGCGGCCTCG
CGGTCCGACC TGACCTCGGC CGAGGGCGAT CTCAAGGTGG CGCGCGAGGC CTACAAGGCC
GCCGTGGGCC ATTATCCGGT CGATCTCGCG CCGCGTCCTG CCGCGCCGCG CACCGCCGCG
ACCATGGAGG CGGCGCGTCA GGTGGCGCTC CGCGCCCATC CGCAGGTGCG CCAGGCCCAG
CGTCAGGTGG CGGCGGCCGA CCTGAACGTG GCGCGCGCCA AGGCCGCGAT GCGGCCTTCG
ATCAGCGCCG AGGCGAATGT GGGGCTCGAC GACGAGGGGC AGGAGTCGGC CAGCGTCGGC
CTCTCGCTCC GGCACACGCT TTATGCCGGG GGCGAGCTGT CCGCGCTTTA CCGCCAGACG
CTTGCCAACC GCGATGCGCA GAAGGCGAAC CTGCTGCAGA CCGGCGTGAA TGTGGCGCAG
AATGTGGGCG TCGCCTGGTC GACGGTCGAG GTGGCCTCCG CCGCCATCGC CGCGGGGGAC
GAGGAAGTCC GCGCTGCCCG CACCGCCTTC GAGGGCGTGC GCGAGGAAGC GACGCTCGGT
GCCCGGACCA CGCTCGACGT GCTGAACGCC GAGCAGGACC TCCTGAACTC GCAGGCCGAC
CGTCTCACCG CCGAGGCGCA GCGCTATGTC GGGATCTATC AGGTGCTGGC CTCGATGGGG
CTCCTGACCG TCGAACATCT CAATCTGGGT ATCCCGACCT ACGATCCGGC AGCCTACTAC
AACGCCGTGA AGCACGCGCC GGCCACCAGC TCGCAGGGCA AGCGGCTCGA CCGCGTGCTG
AAGTCGATCG GCCGGAACTG A
 
Protein sequence
MRKIRLWAVA TCSALAVMAS SAAQAETLAD ALISAYRNSN LLEQNRALLR ATDEDVAVAV 
AALRPVVQFV AQSTYSFQRV HADATLLTPA GRTNVENLNS SVGLTASMTL YDFGRNALAV
EAAKETVLAT REALVQVEQN VLLDAVNAYV QVQLAQSIVN LRRNNLGLID QELQAAQDRF
DVGEVTRTDV SQAQAALAAS RSDLTSAEGD LKVAREAYKA AVGHYPVDLA PRPAAPRTAA
TMEAARQVAL RAHPQVRQAQ RQVAAADLNV ARAKAAMRPS ISAEANVGLD DEGQESASVG
LSLRHTLYAG GELSALYRQT LANRDAQKAN LLQTGVNVAQ NVGVAWSTVE VASAAIAAGD
EEVRAARTAF EGVREEATLG ARTTLDVLNA EQDLLNSQAD RLTAEAQRYV GIYQVLASMG
LLTVEHLNLG IPTYDPAAYY NAVKHAPATS SQGKRLDRVL KSIGRN