Gene EcHS_A2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2029 
Symbol 
ID5593822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2027281 
End bp2028486 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content55% 
IMG OID640921173 
Productputative inner membrane protein 
Protein accessionYP_001458718 
Protein GI157161400 
COG category[R] General function prediction only 
COG ID[COG2391] Predicted transporter component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATGGC AGCAATTCAA ACACGCCTGG TTGATTAAAT TCTGGGCGCC CATCCCCGCG 
GTCATCGCGG CGGGTATTCT CTCCACTTAC TATTTTGGCA TCACTGGCAC CTTTTGGGCT
GTCACAGGTG AATTTACCCG TTGGGGCGGT CAGCTCCTGC AGTTGTTCGG CGTCCATGCT
GAAGAGTGGG GTTATTTTAA AATTATCCAT CTGGAAGGAT CGCCATTAAC CCGCATCGAC
GGAATGATGA TCCTCGGTAT GTTTGGCGGC TGTTTTGCCG CAGCGCTGTG GGCCAACAAT
GTCAAACTGC GCATGCCGCG CAGCCGTATC CGCATTATGC AGGCCATCAT TGGCGGTATT
ATCGCCGGTT TTGGCGCACG TCTGGCAATG GGCTGTAACC TGGCGGCGTT CTTTACCGGA
ATTCCACAGT TCTCGCTGCA CGCCTGGTTC TTTGCCATCG CCACTGCCAT TGGTTCATGG
TTTGGCGCGC GCTTTACCCT TCTGCCCATC TTCCGTATTC CCGTGAAAAT GCAGAAAGTT
TCTGCTGCCT CACCGTTGAC GCAAAAACCG GATCAGGCGC GGCGTCGTTT TCGTCTCGGG
ATGCTGGTCT TTTTCGGCCT GCTGGGCTGG GCGCTGCTGA CGGCGATGAA CCAGCCCAAA
CTGGGGCTGG CAATGCTGTT TGGCGTCGGC TTTGGTTTAC TGATTGAACG TGCGCAAATC
TGCTTTACTT CGGCGTTCCG CGACATGTGG ATCACCGGAC GTACCCATAT GGCGAAAGCA
ATCATTATCG GTATGGCGGT AAGTGCCATC GGGATCTTCA GTTACGTACA GTTAGGCGTT
GAACCCAAAA TCATGTGGGC GGGACCAAAC GCGGTAATTG GTGGTTTACT GTTTGGTTTT
GGCATCGTGC TGGCAGGCGG CTGCGAAACC GGCTGGATGT ACCGCGCGGT AGAAGGCCAG
GTGCACTACT GGTGGGTCGG TCTGGGCAAC GTGATCGGCT CAACGATTCT GGCGTACTAC
TGGGATGATT TCGCTCCGGC GCTGGCCACC GACTGGGACA AAATCAACCT GCTGAAAACC
TTTGGCCCGA TGGGCGGCCT GCTGGTGACA TATTTGCTGT TGTTTGCTGC TCTAATGTTG
ATTATTGGCT GGGAAAAACG CTTCTTCCGC CGTGCGGCAC CGCAGACTGC TAAGGAGATC
GCATGA
 
Protein sequence
MSWQQFKHAW LIKFWAPIPA VIAAGILSTY YFGITGTFWA VTGEFTRWGG QLLQLFGVHA 
EEWGYFKIIH LEGSPLTRID GMMILGMFGG CFAAALWANN VKLRMPRSRI RIMQAIIGGI
IAGFGARLAM GCNLAAFFTG IPQFSLHAWF FAIATAIGSW FGARFTLLPI FRIPVKMQKV
SAASPLTQKP DQARRRFRLG MLVFFGLLGW ALLTAMNQPK LGLAMLFGVG FGLLIERAQI
CFTSAFRDMW ITGRTHMAKA IIIGMAVSAI GIFSYVQLGV EPKIMWAGPN AVIGGLLFGF
GIVLAGGCET GWMYRAVEGQ VHYWWVGLGN VIGSTILAYY WDDFAPALAT DWDKINLLKT
FGPMGGLLVT YLLLFAALML IIGWEKRFFR RAAPQTAKEI A