Gene Rsph17025_1669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1669 
Symbol 
ID5082747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1713813 
End bp1715084 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content70% 
IMG OID640483227 
Productflagellar basal body FlaE domain-containing protein 
Protein accessionYP_001167867 
Protein GI146277708 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.456133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCA ACACCGCCCT TTCGGGTCTC TCCGCCGCCC AGCACGATAT TGCCGCCACC 
TCGCACAACA TCGCGAACGT GGGCACCATC GGCTTCCGCG GCAGCCGCGC CGAATTTGCC
GATGTGTTCA ACTCGTCGCC CTACAGCATC GCCCGGACGG CGGTGGGATC GGGCGTGCAG
ACCCTGCGCA CCGCGATGCA GTTCAGCCAG GGCTCGGTCG TGGCCACGGG CAACACGCTC
GACCTCGCCA TCGAGGGGCA GGGCTTCTTT GCCACCGAGC CCGCCGTGGG CCCGAACTCG
GCCAAGCCCG AGCCGATCTA CACGCGCGCG GGCGCCTTCG GGCTGAACGA CAAGGGGGTG
GCGGTCAATG CCTCGGGGCA GAAGCTGCTC GCCTGGCCCG TGAGCGTCGA GGGCGACGCG
CTGAGCCAGG TGCCGGGCAC GGCCGTGCCC CTCACCATCC CGCTCACGAT GGGCTCGCCG
GTCGGCACCA AGGCCGTCAG GATGACGGTG GACCTGCCGA CGGATGACGC CATGCTGGGC
CAGCAGGCGG CGGTGCCTCC GGCTGCGGCC TTCGACGCGG CCGACCCCAC CACCTATGCC
GCCGTCACGG CGATCCCGGT CTTCGATGCG AAGGGCAATG CGGTCGAGGC GGCGGCCTAT
TTCATCAAGA CCGAGAACCC CGCGGCGGGC AGCCCGGACA CGGGCTGGGC GGTGCGGCTC
GTCGTCGCCG GCGAGACGCT GACGCCGGCC GAGGGCGATC TCGCCTTTGA CGCGACCGGC
GCGCTGGCCG GGGGCACCGG CAGCCTCAGC TTCACCACCG GCATCGGCAC GCCCTACACG
CTCGATCTGA CCGGCACCGC GCTGACCAAC CGCAGCTTCG AGGTCAACAC CGTCAACCAG
GACGGCAAGA GCGCGGCCGC GCTGACCAGC CTCGAGGTGG ATGCCAGCGG CACGGTCTGG
GCGGCCTACG GCGCCGGCGA CTCCGTCGCC ATGGGGCAGG TGGTGCTGGT GACCTTCGCC
AACCCGCAGG CGCTGCGCCA GCTCGGCGCC TCGGGCTTCG CGGCCACCGC CGATTCCGGC
CAGCCCGTCG CGGGCACGGC GGGCGACTCG GGCTTCGGGA TCATCCGGGC CGGCGCGCTC
GAACATGCCA ACGTCGATCT GACCGAGGAA CTCGTCCATC TGATCACCGC GCAGCGCAAC
TACCAGGCCT CGGCCAAGGC GATGGAGACC TCGAACTCGC TGATGCAGAC GATCATGAAC
ATCCGCAGCT GA
 
Protein sequence
MSINTALSGL SAAQHDIAAT SHNIANVGTI GFRGSRAEFA DVFNSSPYSI ARTAVGSGVQ 
TLRTAMQFSQ GSVVATGNTL DLAIEGQGFF ATEPAVGPNS AKPEPIYTRA GAFGLNDKGV
AVNASGQKLL AWPVSVEGDA LSQVPGTAVP LTIPLTMGSP VGTKAVRMTV DLPTDDAMLG
QQAAVPPAAA FDAADPTTYA AVTAIPVFDA KGNAVEAAAY FIKTENPAAG SPDTGWAVRL
VVAGETLTPA EGDLAFDATG ALAGGTGSLS FTTGIGTPYT LDLTGTALTN RSFEVNTVNQ
DGKSAAALTS LEVDASGTVW AAYGAGDSVA MGQVVLVTFA NPQALRQLGA SGFAATADSG
QPVAGTAGDS GFGIIRAGAL EHANVDLTEE LVHLITAQRN YQASAKAMET SNSLMQTIMN
IRS