Gene EcolC_1716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1716 
Symbol 
ID6065051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1912864 
End bp1914582 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content49% 
IMG OID641601128 
Productflagellin 
Protein accessionYP_001724693 
Protein GI170019739 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.363061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAAG TCATTAATAC CAACAGCCTC TCGCTGATCA CTCAAAATAA TATCAACAAG 
AACCAGTCTG CGCTGTCGAG TTCTATCGAG CGTCTGTCTT CTGGCTTGCG TATTAACAGC
GCGAAGGATG ACGCAGCGGG TCAGGCGATT GCTAACCGTT TTACTTCTAA CATTAAAGGC
CTGACTCAGG CTGCACGTAA CGCCAATGAC GGTATTTCTG TTGCACAGAC CACTGAAGGC
GCGCTGTCCG AAATCAACAA CAACTTACAG CGTGTGCGTG AACTGACCGT TCAGGCGACC
ACCGGTACCA ACTCCCAGTC TGATCTGGAC TCTATCCAGG ACGAAATCAA ATCCCGTTTG
GACGAAATTG ACCGCGTATC TGGTCAGACT CAGTTCAACG GCGTGAACGT ACTGGCAAAA
GACGGTTCGA TGAAAATTCA GGTTGGCGCG AATGATGGCC AGACCATCAC TATCGACCTG
AAGAAGATTG ACTCTTCTAC GTTGAAACTG ACTGGTTTTA ACGTGAATGG TTCTGGTTCT
GTGGCGAATA CTGCGGCGAC TAAAGCTGAT TTGGCTGCTG CTGCAATTGG TACCCCTGGG
GCAGCAGATT CTACAGGTGC CATTGCTTAC ACAGTAAGTG CTGGGCTGAC TAAAACTACA
GCCGCAGATG TACTGTCTAG CCTCGCTGAT GGTACGACTA TTACAGCCAC AGGCGTGAAA
AATGGCTTTG CTGCAGGAGC CACTTCCAAT GCCTATAAAC TTAACAAAGA TAATAATACA
TTTACTTATG ACACGACTGC TACGACAGCT GAGCTGCAGT CTTACCTGAC TCCGAAAGCG
GGCGACACTG CAACATTCAG TGTTGAAATT GGTGGTACTA CACAAGACGT CGTGCTGTCC
AGTGATGGCA AACTCACTGC TAAGGATGGC TCTAAGCTTT ACATTGATAC AACTGGTAAC
TTAACTCAGA ATGGTGGTAA TAACGGTGTT GGAACACTCG CGGAAGCGAC TCTGAGTGGT
TTAGCTCTGA ACAACAATAA TGGTGCAGCG GCTGTTAAAT CCACAATTAC TACAGCTGAT
AACACTTCGA TTGTACTGAA TGGTTCAAGC GATGGTACTG AAGGTACGAT TGCTGTTACA
GGCGCTGTAA TTAGTTCAGC TGCTCTGCAA TCTGCAAGCA AAACGACTGG TTTCACTGTT
GGTACAGCAG ACACAGCTGG TTATATCTCT GTAGGTACTG ATGGGAGTGT TCAGGCATAT
GATGTTGCGA CTTCTGGCAA CAAAGATTCT TACACCAACA CTGACGGTAC ACTGACTACT
GATAACACCA CTAAACTGTA TCTGCAGAAA GATGGCTCTG TAACCAACGG TTCAGGTAAA
GCGGTCTATG TAGAAGCGGA TGGTGATTTC ACTACCGACG CTGCAACCAA AGCCGCAACC
ACCACCGATC CGCTGGCCGC TCTGGATGAC GCAATCAGCC AGATCGACAA GTTCCGTTCA
TCCTTGGGTG CTATCCAGAA CCGTCTGGAT TCCGCAGTCA CCAACCTGAA CAACACCACT
ACCAACCTGT CCGAAGCGCA GTCCCGTATT CAGGACGCCG ACTATGCGAC CGAAGTGTCC
AACATGTCGA AAGCGCAGAT TATTCAGCAG GCAGGTAACT CCGTGCTGGC AAAAGCTAAC
CAGGTACCGC AGCAGGTTCT GTCTCTGCTG CAGGGTTAA
 
Protein sequence
MAQVINTNSL SLITQNNINK NQSALSSSIE RLSSGLRINS AKDDAAGQAI ANRFTSNIKG 
LTQAARNAND GISVAQTTEG ALSEINNNLQ RVRELTVQAT TGTNSQSDLD SIQDEIKSRL
DEIDRVSGQT QFNGVNVLAK DGSMKIQVGA NDGQTITIDL KKIDSSTLKL TGFNVNGSGS
VANTAATKAD LAAAAIGTPG AADSTGAIAY TVSAGLTKTT AADVLSSLAD GTTITATGVK
NGFAAGATSN AYKLNKDNNT FTYDTTATTA ELQSYLTPKA GDTATFSVEI GGTTQDVVLS
SDGKLTAKDG SKLYIDTTGN LTQNGGNNGV GTLAEATLSG LALNNNNGAA AVKSTITTAD
NTSIVLNGSS DGTEGTIAVT GAVISSAALQ SASKTTGFTV GTADTAGYIS VGTDGSVQAY
DVATSGNKDS YTNTDGTLTT DNTTKLYLQK DGSVTNGSGK AVYVEADGDF TTDAATKAAT
TTDPLAALDD AISQIDKFRS SLGAIQNRLD SAVTNLNNTT TNLSEAQSRI QDADYATEVS
NMSKAQIIQQ AGNSVLAKAN QVPQQVLSLL QG