Gene EcolC_3665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3665 
Symbol 
ID6065370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4013375 
End bp4015042 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content55% 
IMG OID641603080 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_001726603 
Protein GI170021649 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCAAT TCGTTTATAC CATGCATCGT GTCGGCAAAG TTGTTCCGCC GAAACGTCAT 
ATTTTGAAAA ACATCTCTCT GAGTTTCTTC CCTGGGGCAA AAATTGGTGT CCTGGGTCTG
AACGGCGCGG GTAAGTCTAC CCTGCTGCGC ATTATGGCGG GCATTGATAA AGACATCGAA
GGTGAAGCGC GTCCGCAGCC AGACATCAAG ATTGGTTACC TGCCGCAGGA ACCGCAGCTG
AACCCGGAAC ACACCGTGCG TGAGTCCATT GAAGAAGCGG TTTCTGAAGT GGTTAACGCC
CTGAAACGCC TGGATGAAGT GTATGCGCTG TACGCCGATC CGGATGCCGA TTTTGACAAG
CTGGCCGCTG AACAAGGCCG TCTGGAAGAG ATCATTCAGG CTCACGACGG TCATAACCTG
AACGTACAGC TGGAGCGTGC GGCGGATGCG CTACGTCTGC CGGACTGGGA CGCGAAAATC
GCTAACCTCT CCGGTGGTGA GCGTCGTCGC GTAGCGTTGT GCCGCCTGCT GCTGGAAAAA
CCAGACATGC TGCTGCTCGA CGAACCGACC AACCACCTGG ATGCCGAATC CGTGGCCTGG
CTGGAACGCT TCCTGCACGA CTTCGAGGGC ACCGTGGTGG CGATTACCCA CGACCGTTAC
TTCCTCGATA ACGTTGCAGG CTGGATCCTC GAACTTGACC GCGGTGAAGG TATTCCGTGG
GAAGGCAACT ACTCCTCCTG GCTGGAGCAG AAAGATCAGC GCCTGGCGCA GGAAGCTTCA
CAAGAAGCGG CGCGTCGTAA GTCGATCGAG AAAGAGCTGG AGTGGGTACG TCAGGGAACT
AAAGGCCGCC AGTCGAAAGG TAAAGCACGT CTGGCACGCT TTGAAGAGCT GAACAGCACC
GAATATCAGA AACGTAACGA AACCAACGAA CTGTTTATTC CACCTGGACC GCGTCTGGGC
GATAAAGTGC TGGAAGTCAG CAACCTGCGT AAATCCTACG GTGATCGCCT GCTGATTGAT
GACCTGAGCT TCTCGATCCC GAAAGGGGCA ATCGTCGGGA TCATCGGTCC GAACGGCGCG
GGTAAATCGA CCCTGTTCCG TATGATCTCT GGTCAGGAAC AGCCGGACAG CGGCACCATC
ACTTTAGGTG AAACGGTGAA ACTGGCATCG GTTGATCAGT TCCGTGACTC AATGGATAAC
AGCAAAACCG TTTGGGAAGA AGTTTCCGGC GGGCTGGATA TTATGAAGAT CGGCAACACC
GAGATGCCAA GCCGCGCCTA CGTTGGCCGC TTTAACTTTA AAGGGGTTGA TCAGGGTAAA
CGCGTTGGTG AACTTTCCGG TGGTGAGCGC GGTCGTCTGC ATCTGGCGAA GCTGCTGCAG
GTTGGCGGCA ACATGCTGCT GCTCGACGAA CCAACCAACG ACCTGGATAT CGAAACCCTG
CGCGCGCTGG AAAACGCCCT GCTGGAGTTC CCGGGCTGTG CGATGGTTAT CTCGCACGAC
CGTTGGTTCC TCGACCGTAT CGCCACGCAC ATCCTGGACT ACCAGGATGA AGGTAAAGTT
GAGTTCTTCG AAGGTAACTT TACTGAGTAC GAAGAGTACA AGAAACGCAC GCTGGGCGCA
GACGCACTGG AGCCGAAGCG TATCAAGTAC AAGCGTATTG CGAAGTAA
 
Protein sequence
MAQFVYTMHR VGKVVPPKRH ILKNISLSFF PGAKIGVLGL NGAGKSTLLR IMAGIDKDIE 
GEARPQPDIK IGYLPQEPQL NPEHTVRESI EEAVSEVVNA LKRLDEVYAL YADPDADFDK
LAAEQGRLEE IIQAHDGHNL NVQLERAADA LRLPDWDAKI ANLSGGERRR VALCRLLLEK
PDMLLLDEPT NHLDAESVAW LERFLHDFEG TVVAITHDRY FLDNVAGWIL ELDRGEGIPW
EGNYSSWLEQ KDQRLAQEAS QEAARRKSIE KELEWVRQGT KGRQSKGKAR LARFEELNST
EYQKRNETNE LFIPPGPRLG DKVLEVSNLR KSYGDRLLID DLSFSIPKGA IVGIIGPNGA
GKSTLFRMIS GQEQPDSGTI TLGETVKLAS VDQFRDSMDN SKTVWEEVSG GLDIMKIGNT
EMPSRAYVGR FNFKGVDQGK RVGELSGGER GRLHLAKLLQ VGGNMLLLDE PTNDLDIETL
RALENALLEF PGCAMVISHD RWFLDRIATH ILDYQDEGKV EFFEGNFTEY EEYKKRTLGA
DALEPKRIKY KRIAK