Gene EcolC_2520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2520 
SymbolflgI 
ID6067381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2772537 
End bp2773634 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content54% 
IMG OID641601926 
Productflagellar basal body P-ring protein 
Protein accessionYP_001725478 
Protein GI170020524 
COG category[N] Cell motility 
COG ID[COG1706] Flagellar basal-body P-ring protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00323663 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGATTAAAT TTCTCTCTGC ATTAATTCTT CTACTGGTCA CGACGGCGGC TCAGGCTGAA 
CGTATTCGCG ATCTCACCAG TGTTCAGGGG GTAAGGCAAA ACTCACTGAT TGGCTATGGT
CTGGTGGTGG GGCTGGATGG CACCGGTGAC CAGACAACCC AGACGCCGTT TACCACACAA
ACGCTTAATA ACATGCTCTC ACAGCTGGGA ATTACCGTTC CGACGGGCAC CAATATGCAG
CTAAAAAACG TCGCTGCGGT AATGGTGACA GCGTCACTTC CTCCGTTTGG ACGTCAGGGG
CAAACCATCG ATGTGGTGGT TTCTTCCATG GGTAATGCCA AAAGCTTGCG TGGAGGTACG
TTGTTGATGA CACCGCTTAA GGGCGTTGAC AGTCAGGTGT ATGCGCTGGC GCAGGGCAAT
ATTCTGGTTG GCGGCGCAGG AGCCTCCGCT GGCGGTAGCA GTGTTCAGGT TAACCAACTG
AACGGTGGAC GGATCACCAA TGGTGCGGTT ATTGAGCGTG AATTGCCCAG CCAGTTTGGC
GTCGGGAATA CCCTTAATTT GCAACTTAAC GACGAAGATT TCAGCATGGC GCAGCAAATC
GCTGATACCA TCAACCGCGT GCGTGGATAT GGCAGCGCCA CCGCGTTAGA TGCGCGGACT
ATTCAGGTGC GCGTACCGAG TGGCAACAGT TCCCAGGTCC GCTTCCTTGC CGATATTCAG
AATATGCATG TTAATGTCAC CCCGCAGGAC GCTAAAGTAG TGATTAACTC GCGCACCGGT
TCGGTGGTGA TGAATCGCGA AGTGACCCTC GACAGCTGCG CGGTAGCGCA GGGGAATCTC
TCAGTAACAG TTAATCGTCA GGCCAATGTC AGCCAGCCAG ATACACCGTT TGGTGGTGGA
CAGACTGTGG TTACTCCACA AACGCAGATC GATTTACGCC AGAGCGGCGG TTCGCTGCAA
AGCGTACGTT CCAGCGCCAG CCTCAATAAC GTGGTGCGCG CGCTCAATGC GCTGGGCGCT
ACGCCGATGG ATCTGATGTC CATACTGCAA TCAATGCAAA GTGCGGGATG TCTGCGGGCA
AAACTGGAAA TCATCTGA
 
Protein sequence
MIKFLSALIL LLVTTAAQAE RIRDLTSVQG VRQNSLIGYG LVVGLDGTGD QTTQTPFTTQ 
TLNNMLSQLG ITVPTGTNMQ LKNVAAVMVT ASLPPFGRQG QTIDVVVSSM GNAKSLRGGT
LLMTPLKGVD SQVYALAQGN ILVGGAGASA GGSSVQVNQL NGGRITNGAV IERELPSQFG
VGNTLNLQLN DEDFSMAQQI ADTINRVRGY GSATALDART IQVRVPSGNS SQVRFLADIQ
NMHVNVTPQD AKVVINSRTG SVVMNREVTL DSCAVAQGNL SVTVNRQANV SQPDTPFGGG
QTVVTPQTQI DLRQSGGSLQ SVRSSASLNN VVRALNALGA TPMDLMSILQ SMQSAGCLRA
KLEII