Gene EcSMS35_2170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2170 
Symboluup 
ID6144963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2175723 
End bp2177630 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content53% 
IMG OID641617046 
ProductABC transporter ATPase component 
Protein accessionYP_001744220 
Protein GI170681436 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000710719 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.220785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTAA TCAGTATGCA TGGCGCATGG CTGTCGTTCA GCGACGCGCC GCTTCTCGAT 
AACGCAGAAC TGCATATCGA AGATAACGAA CGTGTTTGTC TGGTAGGTCG TAACGGCGCA
GGCAAATCGA CGTTAATGAA GATCCTCAAC CGTGAACAGG GGCTGGATGA CGGTCGCATT
ATTTATGAGC AAGATTTGAT TGTAGCTCGT CTGCAACAGG ATCCGCCGCG TAACGTTGAG
GGTAGCGTTT ATGATTTCGT TGCCGAAGGC ATTGAAGAAC AAGCGGAATA TCTGAAACGC
TATCACGATC TTTCGCGCCT GGTGATGAAC GACCCGAGCG AGAAAAATCT CAACGAACTG
GCGAAGGTTC AGGAACAGCT GGATCACCAC AACCTGTGGC AGCTGGAAAA CCGCATCAAC
GAAGTGCTGG CGCAACTGGG ATTAGATCCT AATGTTGCGC TGTCGTCGCT TTCCGGCGGC
TGGTTGCGTA AAGCGGCATT AGGACGCGCG CTGGTGAGCA ATCCGCGTGT ATTGCTGCTC
GACGAACCGA CTAACCACCT GGATATTGAA ACCATCGACT GGCTGGAAGG GTTCCTCAAA
ACCTTCAATG GGACAATTAT TTTCATTTCC CACGACCGTT CGTTTATCCG CAATATGGCG
ACGCGCATTG TTGATCTCGA TCGCGGCAAG CTGGTGACCT ATCCAGGGAA TTACGACCAG
TACCTGCTGG AAAAAGAAGA AGCCCTGCGC GTGGAAGAAT TACAAAATGC CGAGTTCGAT
CGCAAACTGG CGCAGGAAGA GGTGTGGATC CGCCAGGGGA TCAAAGCGCG CCGTACCCGT
AATGAAGGCC GCGTACGCGC CCTGAAAGCA ATGCGCCGTG AACGTAGCCA GCGCCGTGAG
GTCATGGGGA CGGCAAAGAT GCAGGTGGAA GAAGCCAGCC GCTCCGGAAA GATCGTTTTC
GAAATGGAAG ACGTTTGCTA CCAGGTTGAC GGTAAGCAAC TGGTGAAAGA TTTTTCTGCC
CAGGTTCTAC GTGGCGACAA AATTGCCCTG ATTGGCCCGA ATGGGTGCGG CAAAACCACG
CTGCTGAAAC TGATGCTCGG TCAGCTTCAG GCGGACAGCG GGCGTATTCA CGTTGGCACC
AAACTGGAAG TGGCTTATTT CGATCAGCAC CGTGCGGAAC TGGATCCCGA TAAAACGGTG
ATGGATAACC TTGCCGAAGG TAAGCAAGAG GTGATGGTTA ACGGCAAGCC ACGCCACGTA
TTGGGCTATT TGCAGGACTT CCTGTTCCAT CCGAAACGGG CGATGACGCC GGTACGTGCG
CTTTCTGGCG GTGAGCGGAA CCGCTTGCTG CTGGCGCGTT TGTTCCTCAA ACCAAGCAAC
TTATTGATTC TTGACGAACC GACCAACGAT CTTGATGTCG AAACGCTGGA ACTGCTGGAA
GAACTGATCG ACAGCTACCA GGGCACGGTA TTGCTGGTAA GCCACGATCG TCAGTTTGTC
GATAACACCG TTACAGAATG CTGGATCTTC GAAGGCGGCG GTAAAATTGG TCGTTATGTC
GGCGGTTATC ATGATGCCCG TGGTCAGCAA GAGCAGTATG TGGCGCTCAA ACAGCCTGCG
GTGAAAAAAA CCGAAGAAGC CGCCGCGGCA AAAGCAGAAA CTGTAAAACG CAGCAGTAGC
AAACTAAGCT ATAAATTGCA GCGCGAACTG GAGCAGCTAC CGCAATTGCT TGAAGATCTG
GAGGCGAAGC TGGAAGCCCT ACAGACGCAG GTGGCGGATG CTTCCTTCTT CAGTCAGCCC
CATGAGCAGA CACAAAAAGT GCTCGCTGAT ATGGCTGCTG CAGAGCAGGA GCTGGAGCAA
GCCTTTGAAC GCTGGGAGTA TCTTGAAGCG TTAAAAAATG GTGGCTGA
 
Protein sequence
MSLISMHGAW LSFSDAPLLD NAELHIEDNE RVCLVGRNGA GKSTLMKILN REQGLDDGRI 
IYEQDLIVAR LQQDPPRNVE GSVYDFVAEG IEEQAEYLKR YHDLSRLVMN DPSEKNLNEL
AKVQEQLDHH NLWQLENRIN EVLAQLGLDP NVALSSLSGG WLRKAALGRA LVSNPRVLLL
DEPTNHLDIE TIDWLEGFLK TFNGTIIFIS HDRSFIRNMA TRIVDLDRGK LVTYPGNYDQ
YLLEKEEALR VEELQNAEFD RKLAQEEVWI RQGIKARRTR NEGRVRALKA MRRERSQRRE
VMGTAKMQVE EASRSGKIVF EMEDVCYQVD GKQLVKDFSA QVLRGDKIAL IGPNGCGKTT
LLKLMLGQLQ ADSGRIHVGT KLEVAYFDQH RAELDPDKTV MDNLAEGKQE VMVNGKPRHV
LGYLQDFLFH PKRAMTPVRA LSGGERNRLL LARLFLKPSN LLILDEPTND LDVETLELLE
ELIDSYQGTV LLVSHDRQFV DNTVTECWIF EGGGKIGRYV GGYHDARGQQ EQYVALKQPA
VKKTEEAAAA KAETVKRSSS KLSYKLQREL EQLPQLLEDL EAKLEALQTQ VADASFFSQP
HEQTQKVLAD MAAAEQELEQ AFERWEYLEA LKNGG