Gene Rsph17029_1716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1716 
Symbol 
ID4897818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1810227 
End bp1811498 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content70% 
IMG OID640112309 
Productflagellar basal body FlaE domain-containing protein 
Protein accessionYP_001043598 
Protein GI126462484 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.727886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.851107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCA ACACCGCCCT CTCGGGGCTC TCCGCCGCCC AGCACGACAT CGCCGCCACG 
TCGCACAACA TCGCCAACGT GGGCACCATC GGCTTCCGCG GCAGCCGCGC GGAATTCGCG
GACGTGTTCA ACTCCTCGCC CTACAGCATC TCGCGGACGG CAGTGGGCTC GGGCGTGCAG
ACGCTGCGCA CCGCGATGCA GTTCAGCCAG GGCTCGGTGG TGGCCACGGG CAACACGCTC
GATCTGGCCA TCGAGGGGCA GGGCTTCTTC GCCACCGAGC CCGCCGTCGG GCCCAACTCT
GCCAAGCCGG AGCCGATCTA CACCCGCGCG GGCGCCTTCG GCCTCAATGC AGAGGGCGTG
GCGATCAATG CCTCGGGTCA GAAGCTGCTC GCCTGGCCGG TGAGCGTGGA AGGCGACGCG
TTGAGCCAGG TGCCGGGCTC GGCCACGCCG CTGACCATCC CGCTCACGAT GGGATCGCCG
GTGGGGACGA GCACGGTTGC GCTGTCGGTC GATCTGCCGA CCGACAATGC GATGCTGGGC
ACGCAGGCGG CCGTTCCGCC CGCCGCGGCC TTCGATCCGG CCGATCCAAC CACCTATGCC
GCTGTCACGG CCGTTCCCAT CTTCGATGCG AAGGGCAATG CGGTCGAGGC GGCGGCCTAT
TTCATCAAGA CCGCGAACCC CACCGCGGGC AACCCCCAGA CAGACTGGGC GGTGCGGCTC
GTGGTGGCGG GCGAGCCGCT GACGCCGGCA CAGGGCGACC TCACCTTCGA CGCGACCGGC
GCGCTCTCGG GCGGCACGGG CGCGCTCAGC TTCACGGCGG CTTCGGGCAA CAGCTACACG
CTCGATCTCA CCGAGACCCA GCTCGCCGAC CGGAGCTTCG AGGTCAATAC CGTGAGCCAG
GACGGCAAGA GCGCCTCGGC GCTGACCAGC CTCGAGGTGG ATGCGAGCGG CACGGTCTGG
GCCGCCTACG GTGCGGGCAG CCCGGTCGCC ATGGGGCAGG TGGTGCTCGT GACCTTCGCC
AACCCGCAGG CGCTGCGCCA GCTCGGGGCC TCGGGCTTCG CCGCCACCGC CGATTCCGGT
CAACCCGTCG CGGGCACGGC GGGCGACTCC GGCTTCGGGA TCATCCGGGC CGGCGCGCTG
GAACATGCCA ACGTGGACCT CACCGAGGAG CTCGTCCATC TCATCACCGC GCAGCGCAAC
TATCAGGCCT CGGCCAAGGC GATGGAGACC TCGAACTCGC TCATGCAGAC GATCATGAAC
ATCCGCAGCT GA
 
Protein sequence
MSINTALSGL SAAQHDIAAT SHNIANVGTI GFRGSRAEFA DVFNSSPYSI SRTAVGSGVQ 
TLRTAMQFSQ GSVVATGNTL DLAIEGQGFF ATEPAVGPNS AKPEPIYTRA GAFGLNAEGV
AINASGQKLL AWPVSVEGDA LSQVPGSATP LTIPLTMGSP VGTSTVALSV DLPTDNAMLG
TQAAVPPAAA FDPADPTTYA AVTAVPIFDA KGNAVEAAAY FIKTANPTAG NPQTDWAVRL
VVAGEPLTPA QGDLTFDATG ALSGGTGALS FTAASGNSYT LDLTETQLAD RSFEVNTVSQ
DGKSASALTS LEVDASGTVW AAYGAGSPVA MGQVVLVTFA NPQALRQLGA SGFAATADSG
QPVAGTAGDS GFGIIRAGAL EHANVDLTEE LVHLITAQRN YQASAKAMET SNSLMQTIMN
IRS