Gene EcolC_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2647 
Symbol 
ID6064666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2898489 
End bp2900396 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content52% 
IMG OID641602054 
ProductABC transporter ATPase component 
Protein accessionYP_001725604 
Protein GI170020650 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0437839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000617984 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCATTAA TCAGTATGCA TGGCGCATGG CTGTCGTTCA GCGACGCGCC GCTTCTCGAT 
AACGCAGAAC TGCATATCGA AGATAACGAA CGTGTTTGTC TGGTGGGCCG CAACGGCGCA
GGCAAATCGA CGTTAATGAA AATCCTCAAC CGTGAACAAG GGCTGGATGA CGGTCGCATT
ATTTACGAGC AAGATTTGAT TGTAGCGCGT CTGCAACAGG ATCCGCCGCG TAACGTTGAG
GGTAGCGTTT ATGATTTCGT TGCCGAAGGC ATTGAAGAAC AAGCGGAATA TCTGAAACGC
TATCACGATA TTTCGCGCCT GGTGATGAAC GACCCGAGCG AGAAAAATCT CAACGAACTG
GCGAAGGTTC AGGAACAGCT GGATCACCAC AACCTGTGGC AGCTGGAAAA CCGCATCAAC
GAAGTGCTGG CGCAACTGGG GTTAGATCCT AACGTTGCGC TGTCGTCGCT TTCCGGCGGC
TGGTTGCGTA AAGCGGCATT AGGACGCGCG CTGGTGAGTA ATCCGCGCGT GCTGTTGCTT
GATGAACCGA CAAACCACCT GGATATTGAA ACCATCGACT GGCTGGAAGG GTTTTTGAAA
ACTTTCAACG GGACGATTAT TTTCATCTCC CACGACCGTT CGTTTATCCG CAATATGGCG
ACGCGCATTG TTGATCTCGA TCGCGGCAAG CTGGTGACCT ATCCAGGGAA TTACGACCAG
TACCTGCTGG AAAAAGAAGA AGCCCTGCGC GTGGAAGAAT TACAAAATGC CGAGTTCGAT
CGCAAACTGG CGCAGGAAGA GGTGTGGATC CGCCAGGGGA TCAAAGCACG CCGTACCCGT
AATGAAGGCC GCGTACGCGC CCTGAAAGCG ATGCGTCGCG AACGTGGTGA ACGTCGCGAA
GTGATGGGTA CCGCAAAGAT GCAGGTGGAA GAGGCCAGCC GCTCCGGTAA AATCGTTTTC
GAAATGGAAG ACGTTTGCTA CCAGGTTAAC GGTAAGCAAC TGGTGAAAGA TTTTTCTGCC
CAGGTTCTAC GTGGCGACAA AATTGCCCTG ATTGGTCCGA ATGGGTGCGG CAAAACCACG
CTGCTAAAAC TGATGCTCGG TCAGCTTCAA GCGGACAGCG GGCGTATTCA CGTTGGCACC
AAACTGGAAG TGGCTTATTT CGATCAGCAC CGCGCGGAAC TGGATCCCGA TAAAACGGTG
ATGGATAACC TTGCCGAAGG TAAGCAAGAG GTGATGGTTA ACGGCAAGCC ACGCCACGTA
TTGGGCTATT TGCAGGACTT TCTGTTCCAT CCGAAACGGG CGATGACGCC GGTACGTGCG
CTTTCTGGCG GTGAGCGGAA CCGCTTGCTG CTGGCGCGTT TGTTCCTCAA ACCAAGCAAC
TTATTGATTC TTGACGAACC GACCAACGAT CTTGATGTCG AAACGCTGGA ACTGCTGGAA
GAACTGATCG ACAGCTATCA GGGCACGGTA TTGCTGGTTA GCCACGATCG TCAGTTTGTC
GATAACACCG TTACAGAATG TTGGATCTTC GAAGGCGGCG GTAAAATTGG TCGTTATGTC
GGCGGTTATC ATGATGCCCG TGGTCAGCAA GAGCAGTATG TGGCGCTCAA ACAGCCTGCG
GTGAAAAAAA CCGAAGAAGC CGCCGCGGCA AAAGCGGAAA CTGTAAAACG CAGCAGTAGC
AAACTAAGCT ATAAATTGCA GCGCGAACTG GAGCAGCTAC CGCAATTGCT CGAAGATCTG
GAGGCGAAGC TGGAAGCCCT ACAGACGCAA GTGGCGGATG CTTCCTTCTT CAGTCAGCCG
CATGAGCAGA CGCAAAAAGT GCTTGCTGAT ATGGCTGCTG CAGAGCAGGA GCTGGAGCAA
GCCTTTGAAC GCTGGGAGTA TCTTGAAGCG TTAAAAAATG GTGGCTGA
 
Protein sequence
MSLISMHGAW LSFSDAPLLD NAELHIEDNE RVCLVGRNGA GKSTLMKILN REQGLDDGRI 
IYEQDLIVAR LQQDPPRNVE GSVYDFVAEG IEEQAEYLKR YHDISRLVMN DPSEKNLNEL
AKVQEQLDHH NLWQLENRIN EVLAQLGLDP NVALSSLSGG WLRKAALGRA LVSNPRVLLL
DEPTNHLDIE TIDWLEGFLK TFNGTIIFIS HDRSFIRNMA TRIVDLDRGK LVTYPGNYDQ
YLLEKEEALR VEELQNAEFD RKLAQEEVWI RQGIKARRTR NEGRVRALKA MRRERGERRE
VMGTAKMQVE EASRSGKIVF EMEDVCYQVN GKQLVKDFSA QVLRGDKIAL IGPNGCGKTT
LLKLMLGQLQ ADSGRIHVGT KLEVAYFDQH RAELDPDKTV MDNLAEGKQE VMVNGKPRHV
LGYLQDFLFH PKRAMTPVRA LSGGERNRLL LARLFLKPSN LLILDEPTND LDVETLELLE
ELIDSYQGTV LLVSHDRQFV DNTVTECWIF EGGGKIGRYV GGYHDARGQQ EQYVALKQPA
VKKTEEAAAA KAETVKRSSS KLSYKLQREL EQLPQLLEDL EAKLEALQTQ VADASFFSQP
HEQTQKVLAD MAAAEQELEQ AFERWEYLEA LKNGG