Gene EcHS_A0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0407 
SymbollacY 
ID5594579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp426432 
End bp427685 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content46% 
IMG OID640919592 
Productgalactoside permease 
Protein accessionYP_001457177 
Protein GI157159859 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID[TIGR00882] oligosaccharide:H+ symporter 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.00583816 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACTATT TAAAAAACAC AAACTTTTGG ATGTTCGGTT TATTCTTTTT CTTTTACTTT 
TTTATCATGG GAGCCTACTT CCCGTTTTTC CCGATTTGGC TACATGACAT CAACCATATC
AGCAAAAGTG ATACGGGTAT TATTTTTGCT GCTATTTCTC TGTTCTCGCT ATTATTCCAA
CCGCTGTTTG GTCTGCTTTC TGACAAACTC GGGCTGCGCA AATACCTGCT GTGGATTATT
ACCGGCATGT TAGTGATGTT TGCGCCGTTC TTTATTTTTA TCTTCGGGCC ACTGTTACAA
TACAACATTT TAGTAGGATC GATTGTTGGT GGTATTTATC TAGGCTTTTG TTTTAACGCC
GGTGCGCCCG CAGTAGAGGC ATTTATCGAG AAAGTCAGCC GTCGCAGTAA TTTCGAATTT
GGTCGCGCGC GGATGTTTGG CTGTGTTGGC TGGGCGCTGT GTGCCTCGAT TGTCGGCATC
ATGTTCACCA TCAATAATCA GTTCGTTTTC TGGCTGGGTT CTGGCTGTGC ACTCATCCTC
GCCATTTTAC TCTTTTTCGC CAAAACGGAT GCGCCCTCTT CCGCCACGGT TGCCAATGCG
GTAGGTGCCA ACCATTCGGC ATTTAGCCTT AAACTGGCGC TGGAACTGTT CAGACAGCCA
AAACTGTGGT TTTTGTCACT GTATGTTATT GGCGTTTCCT GCACCTACGA TGTTTTTGAC
CAACAGTTTG CTAATTTCTT TACTTCTTTC TTTGCCACCG GTGAACAGGG TACGCGGGTA
TTTGGCTACG TAACGACAAT GGGCGAATTA CTTAACGCCT CGATTATGTT CTTTGCGCCA
CTGATCATTA ATCGCATCGG TGGGAAAAAC GCCCTGCTGC TGGCTGGCAC TATTATGTCT
GTACGTATTA TTGGCTCATC GTTCGCCACC TCAGCGCTGG AAGTGGTTAT TCTGAAAACG
CTGCATATGT TTGAAGTACC GTTCCTGCTG GTGGGCTGCT TTAAATATAT TACCAGCCAG
TTTGAAGTGC GTTTTTCAGC GACGATTTAT CTGGTCTGTT TCTGCTTCTT TAAGCAACTG
GCGATGATTT TTATGTCTGT ACTGGCGGGC AATATGTATG AAAGCATCGG TTTCCAGGGC
GCTTATCTGG TGCTGGGTCT GGTGGCGCTG GGCTTCACCT TAATTTCCGT GTTCACGCTT
AGCGGCCCCG GCCCGCTTTC CCTGCTGCGT CGTCAGGTGA ATGAAGTCGC TTAA
 
Protein sequence
MYYLKNTNFW MFGLFFFFYF FIMGAYFPFF PIWLHDINHI SKSDTGIIFA AISLFSLLFQ 
PLFGLLSDKL GLRKYLLWII TGMLVMFAPF FIFIFGPLLQ YNILVGSIVG GIYLGFCFNA
GAPAVEAFIE KVSRRSNFEF GRARMFGCVG WALCASIVGI MFTINNQFVF WLGSGCALIL
AILLFFAKTD APSSATVANA VGANHSAFSL KLALELFRQP KLWFLSLYVI GVSCTYDVFD
QQFANFFTSF FATGEQGTRV FGYVTTMGEL LNASIMFFAP LIINRIGGKN ALLLAGTIMS
VRIIGSSFAT SALEVVILKT LHMFEVPFLL VGCFKYITSQ FEVRFSATIY LVCFCFFKQL
AMIFMSVLAG NMYESIGFQG AYLVLGLVAL GFTLISVFTL SGPGPLSLLR RQVNEVA