Gene EcHS_A2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2041 
SymbolfliI 
ID5594019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2036294 
End bp2037667 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content58% 
IMG OID640921185 
Productflagellum-specific ATP synthase 
Protein accessionYP_001458730 
Protein GI157161412 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1157] Flagellar biosynthesis/type III secretory pathway ATPase 
TIGRFAM ID[TIGR01026] ATPase FliI/YscN family
[TIGR03496] flagellar protein export ATPase FliI 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGC GCCTGACTCG CTGGCTAACC ACGCTGGATA ACTTCGAAGC TAAAATGGCG 
CAGTTGCCTG CGGTACGTCG CTACGGGCGA TTAACTCGCG CTACCGGGCT GGTGCTGGAA
GCCACCGGAT TACAATTGCC GCTCGGCGCA ACCTGTGTGA TTGAGCGCCA GAACGGCAGC
GAAACGCACG AAGTAGAAAG CGAAGTCGTT GGCTTTAACG GTCAACGGCT GTTTTTAATG
CCGCTGGAGG AAGTGGAAGG TGTCCTGCCC GGCGCGCGTG TTTATGCCAA AAACATTTCG
GCTGAAGGGC TGCAAAGCGG CAAGCAGTTG CCGCTCGGTC CGGCGTTATT AGGTCGCGTT
CTGGACGGCA GCGGTAAACC GCTCGATGGC CTCCCCTCCC CCGATACGAC GGAAACCGGT
GCGCTGATTA CCCCGCCATT TAACCCATTG CAACGTACAC CGATTGAACA TGTGCTGGAC
ACCGGCGTGC GCCCAATCAA TGCCCTGCTT ACCGTCGGGC GCGGGCAGCG TATGGGGCTG
TTTGCCGGGT CCGGCGTTGG TAAAAGTGTG CTGCTGGGGA TGATGGCCCG TTACACCCGC
GCCGATGTCA TTGTCGTGGG TTTGATTGGT GAACGCGGAC GCGAAGTAAA AGATTTTATT
GAGAATATCC TCGGTGCCGA AGGGCGTGCA CGCTCCGTGG TGATTGCCGC TCCGGCGGAT
GTTTCTCCGC TTCTGCGAAT GCAAGGTGCC GCCTATGCCA CGCGAATTGC CGAAGATTTT
CGCGATCGTG GTCAGCATGT ATTGCTGATT ATGGACTCCC TCACCCGCTA CGCGATGGCC
CAGCGTGAGA TTGCGCTGGC GATTGGCGAA CCACCTGCCA CTAAAGGTTA TCCACCGTCG
GTGTTTGCCA AATTACCGGC ACTGGTCGAG CGTGCCGGAA ATGGCATTAG CGGCGGCGGC
TCGATTACCG CGTTTTATAC CGTGCTCACC GAAGGCGATG ACCAGCAGGA CCCCATTGCC
GACTCCGCGC GGGCCATCCT CGACGGCCAC ATTGTGCTGT CTCGCCGACT GGCGGAAGCC
GGGCACTATC CGGCTATCGA TATTGAAGCG TCGATCAGTC GCGCAATGAC GGCGTTGATC
AGTGAGCAAC ATTACGCGCG AGTGCGCACC TTCAAACAGC TGTTGTCGAG TTTTCAGCGT
AACCGCGATC TGGTTAGCGT CGGCGCGTAT GCCAAAGGCA GCGATCCGAT GCTCGATAAA
GCCATCGCCC TGTGGCCGCA GCTGGAGGGC TATTTGCAAC AAGGCATTTT TGAACGCGCG
GACTGGGAAG CGTCTCTCCA GGGGCTGGAG CGTATTTTCC CGACAGTGTC ATAA
 
Protein sequence
MTTRLTRWLT TLDNFEAKMA QLPAVRRYGR LTRATGLVLE ATGLQLPLGA TCVIERQNGS 
ETHEVESEVV GFNGQRLFLM PLEEVEGVLP GARVYAKNIS AEGLQSGKQL PLGPALLGRV
LDGSGKPLDG LPSPDTTETG ALITPPFNPL QRTPIEHVLD TGVRPINALL TVGRGQRMGL
FAGSGVGKSV LLGMMARYTR ADVIVVGLIG ERGREVKDFI ENILGAEGRA RSVVIAAPAD
VSPLLRMQGA AYATRIAEDF RDRGQHVLLI MDSLTRYAMA QREIALAIGE PPATKGYPPS
VFAKLPALVE RAGNGISGGG SITAFYTVLT EGDDQQDPIA DSARAILDGH IVLSRRLAEA
GHYPAIDIEA SISRAMTALI SEQHYARVRT FKQLLSSFQR NRDLVSVGAY AKGSDPMLDK
AIALWPQLEG YLQQGIFERA DWEASLQGLE RIFPTVS