Gene ECH74115_2699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2699 
SymbolfliD 
ID6970253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2534013 
End bp2535410 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content52% 
IMG OID643386560 
Productflagellar capping protein 
Protein accessionYP_002271039 
Protein GI209395702 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.171502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTA TTTCATCGCT GGGAGTCGGG TCAGGTCTGG ATTTAAGTTC AATCCTTGAT 
AGCCTCACCG CCGCGCAAAA AGCGACGCTA ACCCCCATTT CAAATCAGCA ATCGTCGTTT
ACCGCTAAAC TTAGCGCCTA CGGTACGCTG AAAAGCGCGC TGACGACTTT CCAGACCGCC
AATACTGCAT TGTCTAAAGC CGATCTTTTT TCCGCCACCA GCAGCACCAC CGCGTTCAGT
GCCACCACCG CGGGTAACGC CATCGCCGGG AAATACACCA TCAGCGTCAC CCATCTGGCG
CAGGCGCAAA CCCTGACCAC GCGCACCACC AGAGACGATA CGAAAACGGC GATCGCCACC
AGCGACAGTA AACTCACCAT TCAACAAGGC GGCGACAAAG ATCCGATTAC CATTGATATC
AGCGCGGCTA ACTCGTCGTT AAGCGGGATC CGTGATGCCA TCAACAACGC AAAAGCAGGT
GTTAGCGCGA GCATTATTAA CGTGGGTAAC GGTGATTATC GTCTGTCAGT CACATCAAAT
GACACCGGCC TTGACAATGC GATGACGCTC TCGGTTAGCG GTGATGATGC GCTACAAAGT
TTTATGGGCT ATGACGCCAG TGCCAGCAGC AACGGCATGG AGGTCTCGGT CGCCGCCCAG
AATGCGCAGC TGACGGTCAA CAACGTCGCC ATTGAGAACA GCAGCAACAC CATCAGCGAC
GCGCTGGAAA ACATCACCCT GAACCTGAAC GATGTCACCA CGGGCAACCA GACGCTAACC
ATCACTCAGG ACACCTCCAA AGCGCAAACG GCGATTAAAG ATTGGGTGAA TGCCTATAAC
TCGCTAATAG ATACCTTCAG CAGCCTGACC AAATACACCG CCGTAGATGC GGGAGCTGAT
AGCCAGAGTT CTAGCAATGG CGCACTGCTC GGCGACTCCA CGCTGCGGAC GATTCAGACG
CAGTTGAAGT CGATGCTGAG TAATACCGTC AGTTCTTCCA GCTATAAAAC GTTGGCGCAG
ATTGGTATCA CGACCGATCC CAGCGATGGC AAACTGGAAC TGGATGCCGA CAAACTCACC
GCTGCACTGA AAAAAGATGC CAGCGGCGTA GGTGCATTGA TTGTTGGCGA TGGTAAAAAA
ACCGGCATCA CGACCACCAT CGGCAGCAAC CTGACCAGTT GGCTTTCGAC AACGGGCATT
ATTAAAGCCG CTACCGATGG CGTTAGTAAG ACCCTGAATA AATTAACTAA AGACTACAAC
GCCGCCAGCG ATCGCATTGA CGCGCAGGTC GCGCGCTACA AAGAACAATT TACCCAACTG
GACGTTTTAA TGACCTCGTT AAACAGCACA AGCAGCTACT TAACGCAGCA GTTCGAAAAC
AACAGTAATT CCAAGTAA
 
Protein sequence
MASISSLGVG SGLDLSSILD SLTAAQKATL TPISNQQSSF TAKLSAYGTL KSALTTFQTA 
NTALSKADLF SATSSTTAFS ATTAGNAIAG KYTISVTHLA QAQTLTTRTT RDDTKTAIAT
SDSKLTIQQG GDKDPITIDI SAANSSLSGI RDAINNAKAG VSASIINVGN GDYRLSVTSN
DTGLDNAMTL SVSGDDALQS FMGYDASASS NGMEVSVAAQ NAQLTVNNVA IENSSNTISD
ALENITLNLN DVTTGNQTLT ITQDTSKAQT AIKDWVNAYN SLIDTFSSLT KYTAVDAGAD
SQSSSNGALL GDSTLRTIQT QLKSMLSNTV SSSSYKTLAQ IGITTDPSDG KLELDADKLT
AALKKDASGV GALIVGDGKK TGITTTIGSN LTSWLSTTGI IKAATDGVSK TLNKLTKDYN
AASDRIDAQV ARYKEQFTQL DVLMTSLNST SSYLTQQFEN NSNSK