Gene EcHS_A1267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1267 
Symbol 
ID5595496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1268050 
End bp1269570 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content38% 
IMG OID640920427 
Producthypothetical protein 
Protein accessionYP_001457989 
Protein GI157160671 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.0117469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGA AAAAACTCTC CGGGTTTAGT TTGGGACTTA TTGCTCTGGC GGTGGGTAAT 
GCATATGCAA CACAATTGTT GGATGATTAT AGTATAATTT CCTATATGAC TGATGAAGAA
TCGCCGATTG AAATCAAAGA TAATAATCCG ATAAGTAATG GAGAGTATCT AACCACTGAA
GACGAAAGCC ATGCTGTGAA AGTGGATGAC GGTGTAACTG GATATATAAA TAATGCCAGT
GTGATGACTA GTGGTGATGG ATCTTATGGT ATTTCTGTTG ATAGTCAAAA CAAAGTATTA
TATATAAGCG ATAGCGATAT TAAGACCTCT GGAAGCGTAT CTGACAAAGA AAATGGAGGG
ATAACAGCCA GCGCAGTAGT CAGTGAATTT GGTGGCACCA TCTTTATGAA TGGTGATAAT
TCAGTCGAGT CGGGTGGGGC ATATTCAGCG GGACTTTTAA GCCAGGTTAA TGATTCTGAA
AAGATGGTAA ATAACACCCG TCTTGAAACC ACAGATAAAA CGAACATTGT TACCTCTGGG
GAAAATGCAG TAGGTGTTCT TGCATGTTCA AGTCCTGGAG AGTCTCGAAC ATGTGTCGAT
GCTGTAGATG ATGAAGTTAG TGATTCTAAC AGTTACGAAG TTATTAGCCG TGCTGATTTA
AAAATGAATG GTGGTTCCAT AACAACTAAT GGCATTAATA GCTATGGTGC TTATGCTAAT
GGGAAAAAAG CATATATTAA TTTAGATTAT GTGGCACTTG AAACTGTGGC TGATGGAAGT
TATGCAGTTG CTATTCGACA AGGTAACATT GATATAAAAA ATAGTTCTAT TACAACAACA
GGCACTAAAG CCCCCATTGC AAAAATATAC AATGGTGGAG AGTTATTTTT TTCCAATGTC
ACCGCGGTAT CAAAACAAGA TAAAGGAATA TCAATTGATG CATCAAATAT CGATTCTCAA
GCCAAAATAG CACTATTAAG TGTTGAACTT TCAAGTGCTT TGGATAGTAT TGATGTTAAC
AAAACTACAA CGGATGTAAG TATCCTTAAT CGAAGTATTA TCACACCTGG TAATAATGTT
CTGGTTAATA ATACTGGAGG TGACTTAAAC ATAATTTCGT CCGACTCTAT TCTAAATGGA
GCGACTAAAC TCGTCAGCGG CACAACCACG CTGAAGCTTT CAGAAAATAC AATCTGGAAT
ATGAAAGATG ACTCCGTTGT TACCCATCTG ACTAATTCAG ACAGTATTAT CAATCTTTCG
TATGATGATG GTCAAACATT TACCCAAGGA AAAACATTAA CCGTAAAAGG TAATTATGTC
GGTAATAATG GTCAGCTTAA TATCCGCACC GTATTAGGTG ATGATAAATC GGCTACGGAC
AGACTTATTG TTGAGGGTAA TACTTCGGGT TCAACTACCG TCTATGTGAA AAATGCTGGA
GGAAGCGGCG CGGCCACGCT AAACGGGATC GAACTCATAA CTGTGAATGG CGATGAATCT
CCAGCAGATG CCTTCAGATA A
 
Protein sequence
MKLKKLSGFS LGLIALAVGN AYATQLLDDY SIISYMTDEE SPIEIKDNNP ISNGEYLTTE 
DESHAVKVDD GVTGYINNAS VMTSGDGSYG ISVDSQNKVL YISDSDIKTS GSVSDKENGG
ITASAVVSEF GGTIFMNGDN SVESGGAYSA GLLSQVNDSE KMVNNTRLET TDKTNIVTSG
ENAVGVLACS SPGESRTCVD AVDDEVSDSN SYEVISRADL KMNGGSITTN GINSYGAYAN
GKKAYINLDY VALETVADGS YAVAIRQGNI DIKNSSITTT GTKAPIAKIY NGGELFFSNV
TAVSKQDKGI SIDASNIDSQ AKIALLSVEL SSALDSIDVN KTTTDVSILN RSIITPGNNV
LVNNTGGDLN IISSDSILNG ATKLVSGTTT LKLSENTIWN MKDDSVVTHL TNSDSIINLS
YDDGQTFTQG KTLTVKGNYV GNNGQLNIRT VLGDDKSATD RLIVEGNTSG STTVYVKNAG
GSGAATLNGI ELITVNGDES PADAFR