Gene EcHS_A2815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2815 
SymbolproX 
ID5595490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2824575 
End bp2825567 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content53% 
IMG OID640921931 
Productglycine betaine transporter periplasmic subunit 
Protein accessionYP_001459448 
Protein GI157162130 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACATA GCGTACTTTT TGCGACAGCG TTTGCCACGC TTATCTCTAC ACAAACTTTT 
GCTGCCGATC TGCCGGGCAA AGGCATTACT GTTAATCCAG TTCAGAGCAC CATCACTGAA
GAAACCTTCC AGACGCTGCT GGTCAGTCGT GCGCTGGAGA AATTAGGTTA TACCGTCAAC
AAACCCAGCG AAGTAGATTA CAACGTTGGC TACACCTCGC TTGCTTCCGG CGATGCAACC
TTCACCGCCG TGAACTGGAC GCCACTGCAT GACAACATGT ACGAAGCTGC CGGTGGCGAT
AAGAAATTTT ATCGTGAAGG GGTATTTGTT AACGGCGCGG CACAGGGTTA CCTGATCGAT
AAGAAAACCG CCGACCAGTA CAAAATCACC AACATCGCAC AACTGAAAGA TCCGAAGATC
GCCAAACTGT TCGATACCAA CGGCGACGGA AAAGCGGATT TAACCGGTTG TAACCCTGGC
TGGGGCTGCG AAGGTGCGAT CAACCACCAG CTTGCCGCGT ATGAACTGAC CCATACCGTG
ACGCATAATC AGGGGAACTA CGCGGCGATG ATGGCCGACA CCATCAGTCG CTACAAAGAG
GGCAAACCGG TGTTTTACTA CACCTGGACG CCGTACTGGG TGAGTAATGA GCTGAAGCCA
GGGAAAGATG TGGTCTGGTT GCAGGTGCCG TTCTCCGCAC TGCCGGGCGA TAAAAACGCC
GATACCAAAC TGCCGAATGG TGCGAATTAT GGCTTCCCGG TCAGCACCAT GCATATCGTT
GCCAACAAAG CCTGGGCCGA GAAAAACCCG GCAGCAGCGA AACTGTTTGC CATTATGCAG
TTGCCAGTGG CAGATATTAA CGCCCAGAAC GCCATTATGC ATGACGGCAA AGCCTCAGAA
GGCGATATTC AGGGCCATGT TGATGGCTGG ATCAAAGCCC ACCAGCAGCA GTTCGATGGC
TGGGTGAATG AGGCGCTGGC AGCGCAGAAG TAA
 
Protein sequence
MRHSVLFATA FATLISTQTF AADLPGKGIT VNPVQSTITE ETFQTLLVSR ALEKLGYTVN 
KPSEVDYNVG YTSLASGDAT FTAVNWTPLH DNMYEAAGGD KKFYREGVFV NGAAQGYLID
KKTADQYKIT NIAQLKDPKI AKLFDTNGDG KADLTGCNPG WGCEGAINHQ LAAYELTHTV
THNQGNYAAM MADTISRYKE GKPVFYYTWT PYWVSNELKP GKDVVWLQVP FSALPGDKNA
DTKLPNGANY GFPVSTMHIV ANKAWAEKNP AAAKLFAIMQ LPVADINAQN AIMHDGKASE
GDIQGHVDGW IKAHQQQFDG WVNEALAAQK