Gene EcHS_A0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0231 
Symbol 
ID5591121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp250417 
End bp251748 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content55% 
IMG OID640919418 
Producthypothetical protein 
Protein accessionYP_001457005 
Protein GI157159687 
COG category[S] Function unknown 
COG ID[COG3522] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03353] type VI secretion protein, VC_A0114 family 


Plasmid Coverage information

Num covering plasmid clones70 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCA CCCGCAACAA GGTGATGTGG CAGGAAGGGA TGCTGATGCG TCCACACCAT 
TTCCAGCAGC AGCAGCGTTA CAACGACTAC CTGGATAACC AGCGTTTCCG GGCCATGAAT
AATTTATCCT GGGGATTTAC CGAACTCACT CTCAACAATG AACTGCTGGC ACAGGGTAAG
ATCATGATTG ACAGCGCGTC AGGCACACTG CCCGACGGCA CCGTCTTTTC TATCCCCGAC
CAGGACGCAC TGCCCGATCC GCTGCACCCG CAACATTTTC CGGACGAGAG AAGCCGCAAT
ATCTACCTCG CTCTGCCGGT CGCCAGTGAT GTGAGAAATG AAATCAGCGA CGGGCGGCGA
ATCGGGCGTT ACCGGCTGAA TTATGCCGAT GTCCGGGATT TGCATTCAGA AGAAGGTGAC
ACGCGAACAC TGACGCTGGG ACAACTGACG CCGCGCATTA TGAGCGGTGC AGAAGATATG
AGCGCCTATA TTACGCTCCC GCTTTGCCGT ATCAGTGATC GCCATGCCGA CGGCTCCCTG
ACGCTGGATG ACGATTTTAT CCCCTCCTGC CAGAATATTC AGGTCAGTAA GAAACTGCGT
GTTTATCTCA AAGAGGTACA GGGGGCCATT GGCGGACGGG CAAGCGATCT GGCAAACCGC
ATTGGCTCTC CGGCGCAGAG CGGCATCGCG GATGTGGCGG AATTTATGAT GTTGCAGTTA
CTTAACCGTA ACCAGACCCG GTTTACCCAT CGCGCTCGTC GATCCCAGCT CCACCCGGAA
GATTTCTACC TTGATCTTGC CGGGTTGCTG GGTGAACTGA TGACCTTTAC CGAGCCGTCG
CGCCTGCCCT GCCCGCTTGA TGTGTATGAT CATCATGACC TGACCAAAAC ATTTAAAACG
CTGTTACCGG AAGTCAAACG GGCGCTGCAT ACCGTACTGT CGCCAAGAGC CGTTAATCTG
CCGCTGCATC TGCGTGACGG TATCTGGCAG GCCGATATCC ATGACACAGA ACTGCTGCAA
TCTGCCACCT TTGTGCTGGC TGTGGCGGCA AACATGCCGG TCGATCAGAT CCAGCGCCAG
TTTATCCAGC AGTCGAAAAT TTCCTCGCCG GAAAAAATCC GCAATATGGT CAGTGTGCAG
ATACCCGGCA TTCCATTGCG TGCCCTGATG GTGGCTCCCC GCCAGCTTCC TTACCATTCC
GGGTTCAGCT ATTTCGAACT CGACAAGAGC GGACAGGCCT GGACAGAAAT GGCTGCCGCC
GGGGCCGTTG CACTGCATGT ATCCGGCAGT TTCCCGGATC TGAACATGCA ACTGTGGGCG
ATAAGAGGGT AA
 
Protein sequence
MATTRNKVMW QEGMLMRPHH FQQQQRYNDY LDNQRFRAMN NLSWGFTELT LNNELLAQGK 
IMIDSASGTL PDGTVFSIPD QDALPDPLHP QHFPDERSRN IYLALPVASD VRNEISDGRR
IGRYRLNYAD VRDLHSEEGD TRTLTLGQLT PRIMSGAEDM SAYITLPLCR ISDRHADGSL
TLDDDFIPSC QNIQVSKKLR VYLKEVQGAI GGRASDLANR IGSPAQSGIA DVAEFMMLQL
LNRNQTRFTH RARRSQLHPE DFYLDLAGLL GELMTFTEPS RLPCPLDVYD HHDLTKTFKT
LLPEVKRALH TVLSPRAVNL PLHLRDGIWQ ADIHDTELLQ SATFVLAVAA NMPVDQIQRQ
FIQQSKISSP EKIRNMVSVQ IPGIPLRALM VAPRQLPYHS GFSYFELDKS GQAWTEMAAA
GAVALHVSGS FPDLNMQLWA IRG