Gene EcHS_A1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1400 
Symbol 
ID5592703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1396540 
End bp1397745 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content50% 
IMG OID640920555 
Producthypothetical protein 
Protein accessionYP_001458114 
Protein GI157160796 
COG category[S] Function unknown 
COG ID[COG4950] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.0205416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTATCGC CGATCCGTCT TTCTCCCCTT CCCGCCTTGC GTCAGGATAA CGATTTCCTT 
TACGACCAAG GAGCGCCCAT GGAACAACGC CACATCACCG GCAAAAGCCA CTGGTATCAT
GAAACGCAAT CCAGTACTGC GGAGTATGAC GTTCTGCCTC TGGTCCCGGA AGCCGCAAAG
GTCAGCGATC CCTTTCTGCT CGACGTGATC CTTGATGAAG AAACGCTGGC CCCCTTCCTT
TCATGGCTGG TCCCTGCGCG CGTTCTTGCA GTGGAATTGT TCCCTGACCA GCTTACCGTG
ACCCGTTCAC AGACTTTCAC CGCTTATGAA CGCTTGTCTA CGGCCCTGAC GGTTGCTCAG
GTTTGCGGCG TCCAGCGGTT ATGTAACTAC TATTCGGCGC GACTTACGCC GCTCCCCGGG
CCTGATTCCT CCAGGGAAAG TAATCATCGG TTGGCACAAA TCACGCAATA TGCCCGCCAA
CTGGTTAGCT CGCCTTCTAT TATCGACAAC CGATCGCGCC AGCATCTGAA TGACGTCGGT
CTTACTGCCT GGGACTGTGT AATCATTAAC CAAATCATTG GATTTATTGG CTTTCAGGCG
CGGACCATTG CGACATTTCA GGCTTATCTT GGACATCCGG TACGCTGGTT ACCCGGTCTG
GAGATACAAA ACTATGCCGA CGCGTCACTG TTTACTGATG AATCAATACG CTGGCGAAGC
AGCTATGAGG TGGAAAAACT ACCTGAAGAT TACACAAAAA GTTCAACTGC AGAACTTTGC
CAACTGGCTG AAACACTCTC TCTCCACCCT ATTTCACTTT CCCTTCTTGA AAAGTTGTTA
AACAGCACAC GGGTTAATAC ACAGCCGGAT AATCAGCTTG CGGCGTTGTT ATGCGCACGG
ATAAATGGCA GCCCTGCTTG TTTTGCCGCC TGTATGGATT CAGTAAATGA ATATAAAAAA
ATCAACACCC TTCTGCGCAA GGGCGAAAAT GAAATTAACC AATGGGCTGA CCGTCATTCT
GTTGAGCACG CTACCGTTCA GGCGATACAA TGGCTGACCC GAGCACCCGA TCGCTTTAGC
GCCGCCCAGT TCAGCCCTTT ACTCGAACAC GAAAAATCAT CAACGCAGAT TATTAATCTG
CTGGTATGGA GCGGGCTGTG TGGCTGGATA AATCGCTTAA AAATCGCGTT GGGTGAGACA
TATTAA
 
Protein sequence
MLSPIRLSPL PALRQDNDFL YDQGAPMEQR HITGKSHWYH ETQSSTAEYD VLPLVPEAAK 
VSDPFLLDVI LDEETLAPFL SWLVPARVLA VELFPDQLTV TRSQTFTAYE RLSTALTVAQ
VCGVQRLCNY YSARLTPLPG PDSSRESNHR LAQITQYARQ LVSSPSIIDN RSRQHLNDVG
LTAWDCVIIN QIIGFIGFQA RTIATFQAYL GHPVRWLPGL EIQNYADASL FTDESIRWRS
SYEVEKLPED YTKSSTAELC QLAETLSLHP ISLSLLEKLL NSTRVNTQPD NQLAALLCAR
INGSPACFAA CMDSVNEYKK INTLLRKGEN EINQWADRHS VEHATVQAIQ WLTRAPDRFS
AAQFSPLLEH EKSSTQIINL LVWSGLCGWI NRLKIALGET Y