Gene Rsph17029_1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1705 
Symbol 
ID4897383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1795631 
End bp1797112 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content66% 
IMG OID640112298 
Productflagellin domain-containing protein 
Protein accessionYP_001043587 
Protein GI126462473 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000174414 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACGA TCAACACGAA CATCGGTGCC ATCGCGGCTC AGGCCAACAT GACCAAGGTG 
AACGACCAGT TCAACACGGC CATGACGCGC CTCTCCACCG GCCTGCGCAT CAACGCGGCC
AAGGATGACG CGGCCGGCAT GGCCATCGGC GAGAAGATGA CCGCGCAGGT CATGGGTCTC
AATCAGGCGA TCCGGAACGC GCAGGATGGC AAGAACCTCG TGGACACGAC CGAAGGCGCG
CACGTCGAGG TTTCCTCGAT GCTCCAGCGT CTGCGCGAAC TGGCCGTTCA GTCCTCGAAC
GACACCAACA CCGCCGCCGA CCGCGGGTCG CTCGCGGCCG AAGGCAAGCA GCTGATCGCC
GAGATCAACC GCGTGGCGGA ATCCACGACC TTCAACGGCA TGAAGGTGCT CGACGGCTCC
TTCACCGGCA AGCAGCTCCA GATCGGCGCC GACTCGGGCC AGACCATGGC GATCAACGTG
GACAGCGCCG CGGCCACCGA CATCGGCGCC CACAAGATCT CCAGCGCCTC GACCGTCGTG
GCCGACGCGG CCCTCACCGA CACGACGATC GCCGCCTCGA CCGACATCAC GATCACGGGC
TTCGCCGGCA GCGACAAGAT CACGACCGCG GCGGGCGACT CCGCGCGCAC GCTGGCCGAA
TCCATCAACA AGAAGACCTC GAGCACCGGC GTCGAGGCCA CGGCCACGAC CAAGGCGCAG
CTGTCGGGCT TCACCAAGGG CGACACGGTG AGCTTCAAGA TCGGCACCGC CGATGGCAAC
GAGGTCTCGA TCGGGGACGT GAGCATCACC GACGCGTCCG ACGTCCGCGG CCTGCGCGAC
GCGATCAACG CGGTCTCGGG TCAGACCGGC ATCACCGCCG CCATGGCCAA GGACGACAAC
AGCAAGATCG TTCTCACCGA CGCGAACGGC GATGACATCA TGCTGACGAG CGTCTCCTCC
ACCACGGCCG ACTTCAAGGT CACCGCGCTG AAATCCGATG GCACCGCCAC CGCCACCAAC
GTGGACATCG GCTTCGGCAC GAACAAGAGC GCCGGCGTGA CCGGGCAGGT GGACCTCGTC
TCGACCAAGT CCTTCTCGGT TGCGGCCTCG GTCTCGGGCA GCGCCACCGC CCACTTCGCC
AACGCCAACG AGGGTTCGGA ACTCAGCTCG GTGGCCGAGA TCGACCTGTC CACGGCCGAA
GGTGCGTCGG CCGCCATCGG TGTGATCGAC GTGGCGCTCT CGAAGATCAG CCAGTCGCGC
TCGGAACTGG GTGCGGTCTC GAACCGCCTC GACTCGACGA TCTCGAACCT GACCAACATC
TCGACCAGCG TGCAGGCTGC CAAGTCGCAG GTGATGGACG CCGACTTCGC GGCGGAATCG
ACGAACCTCG CCCGCTCGCA GATCCTGAGC CAGGCCTCGA CGGCGATGCT GGCGCAGGCG
AACTCCTCGA AGCAGAACGT CCTGAGTCTG CTCCGCGGCT GA
 
Protein sequence
MTTINTNIGA IAAQANMTKV NDQFNTAMTR LSTGLRINAA KDDAAGMAIG EKMTAQVMGL 
NQAIRNAQDG KNLVDTTEGA HVEVSSMLQR LRELAVQSSN DTNTAADRGS LAAEGKQLIA
EINRVAESTT FNGMKVLDGS FTGKQLQIGA DSGQTMAINV DSAAATDIGA HKISSASTVV
ADAALTDTTI AASTDITITG FAGSDKITTA AGDSARTLAE SINKKTSSTG VEATATTKAQ
LSGFTKGDTV SFKIGTADGN EVSIGDVSIT DASDVRGLRD AINAVSGQTG ITAAMAKDDN
SKIVLTDANG DDIMLTSVSS TTADFKVTAL KSDGTATATN VDIGFGTNKS AGVTGQVDLV
STKSFSVAAS VSGSATAHFA NANEGSELSS VAEIDLSTAE GASAAIGVID VALSKISQSR
SELGAVSNRL DSTISNLTNI STSVQAAKSQ VMDADFAAES TNLARSQILS QASTAMLAQA
NSSKQNVLSL LRG