Gene Rsph17029_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1709 
Symbol 
ID4897602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1801096 
End bp1802325 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content72% 
IMG OID640112302 
Productflagellar hook-associated protein 3 
Protein accessionYP_001043591 
Protein GI126462477 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID[TIGR02550] flagellar hook-associated protein 3 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.718507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0295245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCG GAACCACCCT CTTCGCCACG CTTGCCAGCC GCAACTTCTC GCGCCTTCAG 
ACCGAGATCG GCACATTGCA GGCCCGTATC GCCTCGGAGG TGCGCGACCC GCGCCCGTCC
GCCGATCCCA CCCGCGCGGT GCAGCTCTCG GCCGCGAAGG AGATGGAAGC CACGCTCGGG
CGCTACGGCG CGAATGCGAG CTCGGCCGCG GATCGGCTGG CCCATGCCGA CGTGGCGCTG
GGCGAGGTTG CGGCGCGGAT GCGCGAGCTC AAGGATGTCG TGCTGCAGGC AGGAAACCCT
ACGCTCTCCG ACGAGGGACG GGCCGGTCTG CGGATCGTGG CCGAGTCGGC CCGCGAGGCG
CTCTTCGCGC TGGCCAACCG CAAGGATGCG ATGGGGCAGG GCCTCTTTGC GGGCTATGCC
GCGGGTCCGG CCTTCGTGAA GGAGGGCGAT ACGGTCCGGT TCGCGGGCAA CGACGGGCAG
CCCGTGGCCC AGCTCTCCGA GACGCTGCGC GTGGCCACCA GCCTCGGCGG CGCCGAGGTC
TTCATGGCGG TGCCGACCGA GGCGGGGCCG CGCAGCGTCT TCGATCTGGC CGACGATCTG
GTGGCGACCC TGTCGGTGCC GCTGGCACAT TCCAGCCCGC AGCGCAGCGC GGAAGACGCG
GCGCGGCTTT CTCTGGCGGC CCCTCCGGCC CCGGCCACGG TGCGCTTCAC CCTGACGGGC
CCGGTGGGCT CGGTCGAGAT CGAGCAGCGG CTGCCGGGCT CGGCCCTTGC CGCCATCAAT
GCGGCCTCGG CCCAGACCGG CGTCACCGCC ACGGAGGAAC CGGACGGGAC CCTCGTCCTG
GGCGCCGTGG GCCGCATCAC GGTCTCGGAC ATGAGCCGCA GCGACGACCC GCGCGACGTG
CTGGCAACCT TCACCAGCGC GGATGACAAG GGCGGCTGGA TCATGCCCGC GCGGTTCGAC
GCGGCCTCGC TGACCGACGC TTTCGATGCC GCCGTGAGCC ACATGGCCGA GCAGCGGGCC
CGCGCCGGCG CGCTCGCCGC CTCTGTCGAC ACGCAGGCGG ATGCGATCAG GACCCGGCAG
ACCCGAATCG CCACTGCCAT CGGCGGGCTC GAGGATCTCG ACGTGGCCGA GGCGGTCACG
CGGCTGCAGC AGCTTCTCCT GACGCAGGAG GCGGCGCAGC AGACCTATGT CAAGATCGCC
AACCGCAGCC TGTTCGATTA CCTGCGCTAG
 
Protein sequence
MTLGTTLFAT LASRNFSRLQ TEIGTLQARI ASEVRDPRPS ADPTRAVQLS AAKEMEATLG 
RYGANASSAA DRLAHADVAL GEVAARMREL KDVVLQAGNP TLSDEGRAGL RIVAESAREA
LFALANRKDA MGQGLFAGYA AGPAFVKEGD TVRFAGNDGQ PVAQLSETLR VATSLGGAEV
FMAVPTEAGP RSVFDLADDL VATLSVPLAH SSPQRSAEDA ARLSLAAPPA PATVRFTLTG
PVGSVEIEQR LPGSALAAIN AASAQTGVTA TEEPDGTLVL GAVGRITVSD MSRSDDPRDV
LATFTSADDK GGWIMPARFD AASLTDAFDA AVSHMAEQRA RAGALAASVD TQADAIRTRQ
TRIATAIGGL EDLDVAEAVT RLQQLLLTQE AAQQTYVKIA NRSLFDYLR