Gene EcHS_A2265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2265 
Symbol 
ID5593676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2259757 
End bp2260914 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content58% 
IMG OID640921394 
Productquaternary amine ABC transporter permease 
Protein accessionYP_001458930 
Protein GI157161612 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTTATT TCCGTATTAA TCCTGTTCTG GCGCTGCTGC TGTTGCTGAC GGCAATCGCA 
GCGGCGCTGC CGTTTATCAG TTACGCGCCT AATCGTTTAG TTTCGGGTGA GGGGCGTCAT
CTCTGGCAGC TGTGGCCGCA AACGATCTGG ATGCTGGTGG GCGTTGGTTG CGCCTGGCTG
ACGGCCTGTT TTATTCCCGG TAAAAAAGGC AGCATTTGTG CACTCATTCT GGCGCAATTC
GTCTTCGTAT TGCTGGTGTG GGGAGCTGGA AAGGCGGCGA CCCAACTGGC GCAAAATGGC
AGTGCGCTGG CGCGTACCAG CCTCGGCAGT GGTTTCTGGC TGGCTGCGGC GCTGGCATTG
CTGGCCTGTA GCGATGCCAT CCGCCGAATC TCCACGCATC CGCTGTGGCG CTGGTTGTTG
CATATGCAGA TTGCCATTAT TCCGCTGTGG TTGCTGTACT CCGGCACGCT TAACGATCTC
TCACTAATGA AAGAATACGC CAACCGTCAG GATGTGTTTG ACGACGCGCT GGCACAACAT
CTGACGTTGC TGTTTGGTGC GGTGCTGCCT GCGTTAGTGA TTGGTGTGCC GTTGGGCATC
TGGTGCTACT TTTCCACTGC TCGGCAGGGG GCAATTTTTT CTCTGCTCAA TGTCATTCAG
ACCGTGCCTT CGGTGGCGCT CTTTGGCCTG TTGATTGCGC CGCTTGCCGC GCTGGTGACG
GCCTTTCCGT GGCTGGGGAA GCTCGGCATA GCAGGAACCG GAATGACACC CGCACTGATT
GCGCTGGTGC TCTATGCCTT GCTGCCGCTG GTGCGCGGCG TGGTAGTCGG CTTGAACCAG
ATCCCGCGCG ATGTGCTGGA GAGCGCCAGA GCGATGGGCA TGAGCGGGGC GCGGCGATTC
CTGCATGTTC AGTTACCACT GGCGTTACCG GTATTTCTGC GCAGCCTGCG GGTGGTGATG
GTGCAAACTG TAGGTATGGC GGTGATTGCG GCGTTAATCG GCGCAGGCGG TTTTGGTGCG
CTGGTTTTCC AGGGGCTGCT AAGCAGCGCC ATTGATTTAG TGTTGCTGGG GGTGATCCCG
GTAATTGTTC TGGCGGTGCT TACCGACGCG CTGTTCGATT TGCTTATCGC ACTGCTGAAG
GTGAAACGTA ATGATTGA
 
Protein sequence
MTYFRINPVL ALLLLLTAIA AALPFISYAP NRLVSGEGRH LWQLWPQTIW MLVGVGCAWL 
TACFIPGKKG SICALILAQF VFVLLVWGAG KAATQLAQNG SALARTSLGS GFWLAAALAL
LACSDAIRRI STHPLWRWLL HMQIAIIPLW LLYSGTLNDL SLMKEYANRQ DVFDDALAQH
LTLLFGAVLP ALVIGVPLGI WCYFSTARQG AIFSLLNVIQ TVPSVALFGL LIAPLAALVT
AFPWLGKLGI AGTGMTPALI ALVLYALLPL VRGVVVGLNQ IPRDVLESAR AMGMSGARRF
LHVQLPLALP VFLRSLRVVM VQTVGMAVIA ALIGAGGFGA LVFQGLLSSA IDLVLLGVIP
VIVLAVLTDA LFDLLIALLK VKRND