Gene Rsph17025_1662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1662 
Symbol 
ID5082740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1704687 
End bp1705916 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content71% 
IMG OID640483220 
Productflagellin-like protein 
Protein accessionYP_001167860 
Protein GI146277701 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.741191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTTG GAACCACCCT CTTCGCAACC CTCGCCAGCC GGAACTTTGC CCGGATCCAG 
ACCGAGATCG GCGGCTTGCA GGAACGGATC TCGGCCGCAA CGCAGGATCC GCGGCCCTCG
GCCGATCCCG CCCGCGCCCT GCAACTGTCG GCCGCGCGCG AGGTGCAGGA CGCGCTCTCG
CGCTTTTCCG TCAATGCCGG GACGGCGGCC GAGCGGCTGG CCCATCTGGA CGTGACGCTG
GGCGATGTCG CCAGCCATGT GCGCGACCTG AAGGACATCG TCCTCCAGAT GGGCAATGCC
AGCCTGACGG ACGAGGGCCG GGCGGGCCTG CGCATCGAGG CCGAGGCGCT GCGCGAGGCG
ATGCTGGCCG CGGCCAACCG CAAGGACGGA ACGGGGCAGG GGCTGTTCTC GGGCTATGCC
ACGGGCGCGG CCTTCGAAAA GACTGCAACG GGCGTGCGGT TTGCGGGCAA CGCGGGCCAG
CCGGTCGCGC AGCTGTCCGA GAGCCTGCGC GTGGCCACGA GCCTCGGAGG GAACGAGGTC
TTCATGACCG TCGAGACCGA AGGAGGCGTC CGCAGCCTGT TCGATCTCGC CGACGATCTG
GTGGCGGCCC TCTCGCCGCC CATCAGCAAG GCCACCACGT CCCGCACCTC GGTCGGCACG
GCGAGTCTCT CGATCGAGCC GGTTCAGGGC GAGGCCACGC TGCGCTTCAC CCTGACCGGG
CCCGGCGGAT CGGCCGAGAT CGAGCAGCGT CTGCCCGGTT CGGTCGAGGA GGCGATCAAC
GCCGCCGCGG CGACGACCGG CATCACCGCC ACGACGGCTG CGGACGGCTC GCTGCGGCTG
GCGTCGCTGG GCACGATCGA GCTGTCGGGC ATGAGCCGGA GCGACGGGGC GCGCGAGGTG
CTGGCGACCC TCACGGATGA ACGGGGCCGC GAGGGCTGGG TGGTGGACAA GCGGTTCGGC
GCCTCGCCCA TGACGGCCGC CTTCGACGCC GCCATCGGCC ACATGGCCGA GCAGCGGGCG
CGGGCGGGGT CTCTGGCCGC GAGCGTGGAC AGCCAGATGG AGGCGATCAA GGGGCGCCAG
ACGCGGATGA CGCAGACGGT GGCCGGGCTC GAGGATCTGG ATGTGGCGGC GGCGGTGACA
CGGCTTCAGG CGCTTCTGCT GACGCAGGAG GCGGCGCAGC AGACCTATGT GAAGATCGCC
AGCCGAAGCC TCTTCGACTA TCTGCGCTAG
 
Protein sequence
MTLGTTLFAT LASRNFARIQ TEIGGLQERI SAATQDPRPS ADPARALQLS AAREVQDALS 
RFSVNAGTAA ERLAHLDVTL GDVASHVRDL KDIVLQMGNA SLTDEGRAGL RIEAEALREA
MLAAANRKDG TGQGLFSGYA TGAAFEKTAT GVRFAGNAGQ PVAQLSESLR VATSLGGNEV
FMTVETEGGV RSLFDLADDL VAALSPPISK ATTSRTSVGT ASLSIEPVQG EATLRFTLTG
PGGSAEIEQR LPGSVEEAIN AAAATTGITA TTAADGSLRL ASLGTIELSG MSRSDGAREV
LATLTDERGR EGWVVDKRFG ASPMTAAFDA AIGHMAEQRA RAGSLAASVD SQMEAIKGRQ
TRMTQTVAGL EDLDVAAAVT RLQALLLTQE AAQQTYVKIA SRSLFDYLR