Gene EcolC_4013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4013 
Symbol 
ID6064568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4411436 
End bp4413010 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content37% 
IMG OID641603424 
ProductShET2 enterotoxin domain-containing protein 
Protein accessionYP_001726939 
Protein GI170021985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0223767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACTC GCATCCCTCG TAGTTCTTTC TCTGCAAATA TTAATAATAC AGCCCAGACA 
AATGAACACC AAACCCTGAG TGAATTGTTT TATAAAGAAC TCGAGGATAA ATTTTCTGGC
AAGGAGCTGG CGACGCCTCT ATTAAAAAGC TTCTCAGAGA ACTGTAGACA AAATGGTCGT
CATATCTTTA GCAACAAGGA TTTTGTCATT AAATTTTCCA CGTCCGTCTT ACAAGCTGAT
AAGAAAGAAA TTACGATAAT TAATAAAAAC GAAAACACGA CACTTACTCA AACCATTGCC
CCAATATTTG AAGAATACCT AATGGAAATT TTACCTCAAC GCTCAGACAC TCTTGATAAA
CAAGAATTAA ACCTAAAATC AGATAGAAAA GAAAAAGAAT TCCCAAGAAT TAAACTTAAT
GGTCAATGTT ATTTTCCGGG GCGACCCCAA AACCGTATAG TATGCCGACA CATTGCTGCA
CAATATATTA ATGATATTTA TCAGAATGTT GATTACAAAC CCCATCAAGA TGATTACTCT
TCAGCTGAAA AATTTCTCAC GCACTTCAAC AAAAAATGCA AAAACCAGAC TTTGGCGTTG
GTTTCCAGCC GTCCTGAGGG GCGTTGCGTT GCTGCCTGCG GTGATTTCGG GCTAGTTATG
AAAGCATATT TTGACAAGAT GGAATCAAAT GGCATCAGTG TTATGGCAGC CATATTACTG
GTGGATAACC ATGCTTTGAC GGTCCGGCTA AGAATAAAGA ACACAACTGA AGGATGTACC
CATTACGTGG TTTCGGTTTA TGATCCTAAT GTAACTAACG ATAAAATAAG AATTATGAGC
GAAAGCAAAG AGGATATTAA ACACTATTCT CTGATGGATT TTATGAATGT AGATTATAGC
CTCCTGAAAT GGTCAAATGA TCATGTTATT AACCAATCTG TTGCAATAAT TCCAGCACTT
CCGAAAGAAC AGCTATTGAT GTTAAAAGGA TCTGTGGATG AAATAACCCC TCCATTATCA
CCAGCAACGA TGAATTTGCT AATGGCAATT GGTCAGAATC ACCAACTTAC GCAACTGATG
ATTCAGCTCC AGAAAATGCC AGAACTACAT AGAACAGAAA TGTTGACTGC CTATAATAGT
GGACATATGA ACGTTATTAA TACTATTTTT AACGCATTAC CCACTCTGTT TAATACGTTT
AAATTCGATA AAAAAAATAT GAAGCCCCTC CTCCTGGCAA ATAATTCTAA TGAATATCCC
GGTTTGTTTT CAGCGATACA GCATAAACAA CAAAATGTTG TAGAGACGGT TTATCTTGCT
TTATCTAACC ATGCACGCCT GTTTGGATTT ACCGCTGAAG ATATTATGGA TTTTTGGCAA
CACAAAGCCC CACAAAAATA CTCTGCCTTT GAGTTGGCTT TTGAATTGGG TCACCGGGTT
ATTGCTGAAT TAATCCTTAA TACATTAAAT AAGATGGCTG AAAGCTTTGG CTTTACGGAT
AACCCTCGAT ACATTGCGGA GAAAAATTAT ATGGAAGCTT TACTCAAAAA AGCATCTCCC
CATACCGTAC GCTAA
 
Protein sequence
MITRIPRSSF SANINNTAQT NEHQTLSELF YKELEDKFSG KELATPLLKS FSENCRQNGR 
HIFSNKDFVI KFSTSVLQAD KKEITIINKN ENTTLTQTIA PIFEEYLMEI LPQRSDTLDK
QELNLKSDRK EKEFPRIKLN GQCYFPGRPQ NRIVCRHIAA QYINDIYQNV DYKPHQDDYS
SAEKFLTHFN KKCKNQTLAL VSSRPEGRCV AACGDFGLVM KAYFDKMESN GISVMAAILL
VDNHALTVRL RIKNTTEGCT HYVVSVYDPN VTNDKIRIMS ESKEDIKHYS LMDFMNVDYS
LLKWSNDHVI NQSVAIIPAL PKEQLLMLKG SVDEITPPLS PATMNLLMAI GQNHQLTQLM
IQLQKMPELH RTEMLTAYNS GHMNVINTIF NALPTLFNTF KFDKKNMKPL LLANNSNEYP
GLFSAIQHKQ QNVVETVYLA LSNHARLFGF TAEDIMDFWQ HKAPQKYSAF ELAFELGHRV
IAELILNTLN KMAESFGFTD NPRYIAEKNY MEALLKKASP HTVR