Gene EcE24377A_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3034 
Symbol 
ID5587096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3038216 
End bp3039478 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content55% 
IMG OID640926680 
ProductYgbK domain-containing protein 
Protein accessionYP_001464056 
Protein GI157158596 
COG category[S] Function unknown 
COG ID[COG3395] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01586] cysteine protease domain, YopT-type 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAGA TTGGCGTTAT CGCCGATGAT TTTACCGGCG CGACGGATAT CGCCAGTTTT 
CTGGTGGAAA ACGGTCTACC AACGGTACAA ATTAACGGTG TTCCAACAGG TAAAATGCCG
GAAGCAATCG ACGCACTGGT GATCAGCCTG AAAACGCGCT CCTGTCCGGT GGTTGAAGCC
ACACAGCAAT CGCTGGCGGC TCTGAGCTGG TTGCAACAGC AAGGTTGCAA ACAGATCTAT
TTCAAATACT GCTCTACTTT CGACAGTACG GCGAAAGGTA ATATCGGCCC GGTTACCGAT
GCATTAATGG ATGCTCTCGA CACGCCGTTT ACGGTCTTCT CTCCGGCCCT GCCGGTCAAC
GGACGTACGG TTTATCAGGG GTATTTGTTC GTAATGAATC AACTGCTGGC CGAATCCGGG
ATGCGCCATC ACCCGGTAAA TCCCATGACC GACAGCTATC TTCCCCGTCT GGTTGAAGCG
CAATCCACAG GGCTCTGCGG CGTCGTTTCG GCACATGTTT TCGAACAAGG TGTGGATGCC
GTTCGTCAAG AGCTGGCTCG CTTACAGCAA GAGGGCTACC GCTACGCGGT GCTTGATGCG
CTGACCGAAC ACCATCTGGA AATTCAGGGA GAAGCCTTGC GCGATGCCCC ACTGGTAACG
GGTGGTTCTG GTCTGGCGAT TGGCCTGGCC CGGCAGTGGG CGCAAGAAAA CGGTAACCAG
GCTCGCGAAG CAGGGCGTCC GCTCGCTGGG CGCGGCGTAG TGCTCTCCGG TTCATGCTCT
CAAATGACCA ACCGCCAGGT GGCACATTAC CGTCAAATTG CACCAGCCCG TGAAGTTGAT
GTGGCACGCT GCCTCTCAAC CGAAACTCTG GCCGCTTATG CACACGAACT GGCAGAGTGG
GTTCTGGGCC AGGAAAGTGT ACTTGCTCCA CTGGTTTTTG CCACCGCCAG CACTGACGCA
TTGGCAGCAA TTCAACAGCA ATACGGTGCA CAAAAAGCCA GTCAGGCAGT AGAAACTCTA
TTTTCTAAAC TAGCGGCGCG GTTAGCAGCG GAAGGCGTGA CACGCTTTAT TGTCGCAGGC
GGTGAGACCT CCGGCGTAGT CACACAGAGC CTGGGCATAA AAGGGTTTCA TATTGGCCCA
ACCATTTCCC CCGGCGTGCC GTGGGTAAAC GCACTGGATA AGCCTGTCTC ACTCGCCCTT
AAATCTGGCA ACTTCGGTGA TGAAGCCTTT TTTTCACGAG CCCAAAGAGA GTTTTTATCA
TGA
 
Protein sequence
MIKIGVIADD FTGATDIASF LVENGLPTVQ INGVPTGKMP EAIDALVISL KTRSCPVVEA 
TQQSLAALSW LQQQGCKQIY FKYCSTFDST AKGNIGPVTD ALMDALDTPF TVFSPALPVN
GRTVYQGYLF VMNQLLAESG MRHHPVNPMT DSYLPRLVEA QSTGLCGVVS AHVFEQGVDA
VRQELARLQQ EGYRYAVLDA LTEHHLEIQG EALRDAPLVT GGSGLAIGLA RQWAQENGNQ
AREAGRPLAG RGVVLSGSCS QMTNRQVAHY RQIAPAREVD VARCLSTETL AAYAHELAEW
VLGQESVLAP LVFATASTDA LAAIQQQYGA QKASQAVETL FSKLAARLAA EGVTRFIVAG
GETSGVVTQS LGIKGFHIGP TISPGVPWVN ALDKPVSLAL KSGNFGDEAF FSRAQREFLS