Gene EcE24377A_1977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1977 
Symbol 
ID5587655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1961695 
End bp1962861 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content56% 
IMG OID640925649 
Productputative ABC transporter solute-binding protein 
Protein accessionYP_001463052 
Protein GI157155445 
COG category[R] General function prediction only 
COG ID[COG4134] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.942474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCATT GTGGGTGGTT GCTGGGATTG TTATCGCTGT TTTCTTTGGC AACACATGCC 
AGTGACTGGC AGGAAATTAA AAATGAGGCC AAAGGGCAAA CCGTCTGGTT TAACGCCTGG
GGCGGCGATA CCGCAATTAA CCGCTATCTC GACTGGGTTA GCGGCGAGAT GAAAACCCAT
TACGCTATAA ACCTGAAGAT TGTTCGTCTC GCGGATGCCG CAGACGCGGT GAAGCGCATT
CAGACCGAAG CTGCTGCCGG ACGTAAAACG GGCGGCTCGG TGGATCTGCT CTGGGTGAAC
GGCGAAAACT TCCGCACCTT AAAAGAGGCC AATTTATTAC AAACGGGCTG GGCGGAGACT
CTGCCCAACT GGCGCTATGT CGACACACAG CTGCCGGTGC GGGAAGATTT TTCAGTGCCG
ACACAAGGTG CGGAATCGCC CTGGGGCGGC GCACAACTGA CGTTTATCGC CCGCCGCGAT
GTTACGCCAC AGCCACCACA AACGCCGCAA GCCTTACTGG AGTTTGCTAA AGTCAATCCC
GGCACGGTTA CCTATCCGCG CCCACCGGAC TTTACCGGCA CGGCGTTTCT TGAACAGTTG
CTGATTATGC TGACGCCAGA TCCCGCCGCA TTAAAAGAAG CGCCGGACGA TGCGACTTTC
GCCCGTGTCA CTGCTCCCTT GTGGCAATAT CTTGATGCGC TACATCCGTA TTTGTGGCGC
GAAGGAAAGG ATTTCCCGCC TTCGCCCGCG CGGATGGATG CTCTGCTGAA AGCCGGCACA
TTGCGCCTGT CGCTGACCTT TAGCCCCGCG CATGCGCAGC AAAAAATCGC CAGCGGCGAT
TTGCCGGCAA GCAGTTACAG TTTTGGCTTT CGCGAGGGGA TGATAGGCAA CGTGCATTTC
GTCACCATTC CAGCCAACGC GAATGCCAGT GCTGCGGCGA AGGTAGTTGC CAATTTCCTG
CTCTCACCCG ATGCGCAACT GCGTAAAGCA GATCCCGTTG TCTGGGGCGA TCCTTCTGTT
CTCGATCCGC AAAAACTGCC TGACGGGCAG CGCGAAATAT TGCAATCAAG AATGCCGCAG
GATCTGCCGC CGGTACTGGC TGAACCGCAC GCAGGATGGG TGAATGCACT GGAACAAGAA
TGGTTACGCC GTTACGGTAC GCATTAA
 
Protein sequence
MRHCGWLLGL LSLFSLATHA SDWQEIKNEA KGQTVWFNAW GGDTAINRYL DWVSGEMKTH 
YAINLKIVRL ADAADAVKRI QTEAAAGRKT GGSVDLLWVN GENFRTLKEA NLLQTGWAET
LPNWRYVDTQ LPVREDFSVP TQGAESPWGG AQLTFIARRD VTPQPPQTPQ ALLEFAKVNP
GTVTYPRPPD FTGTAFLEQL LIMLTPDPAA LKEAPDDATF ARVTAPLWQY LDALHPYLWR
EGKDFPPSPA RMDALLKAGT LRLSLTFSPA HAQQKIASGD LPASSYSFGF REGMIGNVHF
VTIPANANAS AAAKVVANFL LSPDAQLRKA DPVVWGDPSV LDPQKLPDGQ REILQSRMPQ
DLPPVLAEPH AGWVNALEQE WLRRYGTH