Gene EcolC_0975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0975 
Symbol 
ID6067922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1060288 
End bp1061550 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content54% 
IMG OID641600383 
Producttype III effector Hrp-dependent outers 
Protein accessionYP_001723971 
Protein GI170019017 
COG category[S] Function unknown 
COG ID[COG3395] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01586] cysteine protease domain, YopT-type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.847022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.745569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGA TTGGCGTTAT CGCCGATGAT TTTACCGGCG CGACGGATAT CGCCAGTTTT 
CTGGTGGAAA ACGGTCTACC AACGGTACAA ATTAACGGTG TTCCAACAGG TAAAATGCCG
GAAGCAATCG ACGCACTGGT GATCAGCCTG AAAACGCGCT CCTGTCCAGT GGTTGAAGCC
ACACAGCAAT CGCTGGCGGC TCTGAGCTGG TTGCAACAGC AAGGTTGCAA ACAGATCTAT
TTCAAATACT GCTCTACTTT CGACAGTACG GCGAAAGGTA ATATTGGCCC GGTTACCGAT
GCCTTAATGG ATGCTCTCGA CACGCCGTTT ACGGTCTTCT CTCCGGCCCT GCCGGTCAAC
GGACGTACGG TTTATCAGGG GTATTTGTTC GTAATGAATC AACTGCTGGC CGAATCCGGG
ATGCGCCATC ACCCGGTAAA TCCCATGACC GACAGCTATC TTCCCCGTCT GGTTGAAGCG
CAATCCACAG GGCGCTGCGG CGTCGTTTCG GCACATGTTT TCGAACAAGG TGTGGATGCC
GTTCGTCAAG AGCTGGCTCG CTTACAGCAA GAGGGCTACC GCTACGCGGT GCTTGATGCG
CTGACCGAAC ACCATCTGGA AATTCAGGGA GAAGCCTTGC GCGATGCCCC ACTGGTAACG
GGCGGTTCTG GTCTGGCGAT TGGCCTGGCC CGGCAGTGGG CGCAAGAAAA CGGTAACCAG
GCTCGCAAAG CAGGGCGTCC GCTCGCTGGG CGCGGCGTAG TGCTCTCCGG TTCATGCTCT
CAAATGACCA ACCGCCAGGT AGCACATTAC CGTCAAATTG CACCAGCCCG TGAAGTTGAT
GTGGCACGCT GCCTCTCAAT TGAAACTCTG GCCGCTTATG CACACGAACT GGCAGAGTGG
GTTCTGGGCC AGGAAAGTGT ACTTGCTCCA CTGGTTTTTG CCACCGCCAG CACTGACGCA
TTGGCAGCAA TTCAACAGCA ATACGGTGCA CAAAAAGCCA GTCAGGCAGT AGAAACACTG
TTTTCTCAAC TAGCGGCGCG GTTAGCAGCG GAAGGCGTGA CACGCTTTAT TGTCGCAGGC
GGTGAGACCT CCGGCGTAGT CACACAGAGC CTGGGAATAA AAGGGTTTCA TATTGGCCCA
ACCATTTCCC CCGGCGTGCC GTGGGTAAAC GCACTGGATA AGCCTGTCTC ACTCGCCCTT
AAATCTGGCA ACTTCGGTGA TGACGCCTTT TTTTCACGAG CCCAAAGAGA GTTTTTATCA
TGA
 
Protein sequence
MIKIGVIADD FTGATDIASF LVENGLPTVQ INGVPTGKMP EAIDALVISL KTRSCPVVEA 
TQQSLAALSW LQQQGCKQIY FKYCSTFDST AKGNIGPVTD ALMDALDTPF TVFSPALPVN
GRTVYQGYLF VMNQLLAESG MRHHPVNPMT DSYLPRLVEA QSTGRCGVVS AHVFEQGVDA
VRQELARLQQ EGYRYAVLDA LTEHHLEIQG EALRDAPLVT GGSGLAIGLA RQWAQENGNQ
ARKAGRPLAG RGVVLSGSCS QMTNRQVAHY RQIAPAREVD VARCLSIETL AAYAHELAEW
VLGQESVLAP LVFATASTDA LAAIQQQYGA QKASQAVETL FSQLAARLAA EGVTRFIVAG
GETSGVVTQS LGIKGFHIGP TISPGVPWVN ALDKPVSLAL KSGNFGDDAF FSRAQREFLS