Gene EcHS_A3549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3549 
Symbol 
ID5594020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3525632 
End bp3527545 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content56% 
IMG OID640922666 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_001460147 
Protein GI157162829 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.309193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGTTT TCTCCTCGTT ACAAATTCGT CGCGGCGTGC GCGTCCTGCT GGATAATGCC 
ACCGCCACCA TCAACCCTGG GCAGAAAGTC GGCCTGGTGG GTAAAAACGG CTGTGGTAAA
TCTACCCTGC TGGCATTGCT GAAAAATGAA ATCAGCGCCG ACGGCGGCAG CTACACCTTT
CCGGGAAGCT GGCAACTGGC GTGGGTGAAT CAGGAAACGC CGGCGTTACC GCAAGCGGCG
CTGGAATATG TCATTGACGG CGACCGTGAA TATCGTCAAC TAGAAGCGCA GCTACACGAC
GCCAACGAAC GTAACGACGG GCACGCCATT GCGACCATTC ATGGCAAGCT GGATGCTATT
GACGCATGGA GTATTCGCTC CCGTGCTGCC AGCCTGCTGC ACGGCCTCGG TTTCAGCAAT
GAACAACTGG AGCGCCCGGT AAGTGATTTC TCCGGTGGCT GGCGTATGCG TCTTAACCTT
GCCCAGGCGC TGATTTGCCG TTCAGACTTG CTGCTGCTCG ACGAACCGAC TAACCACCTC
GATCTCGATG CCGTTATCTG GCTGGAAAAA TGGTTGAAGA GCTATCAGGG CACGCTGATC
CTGATCTCTC ACGACCGCGA CTTCCTCGAT CCGATCGTTG ATAAAATTAT TCATATCGAA
CAACAAAGCA TGTTCGAGTA CACCGGCAAC TACAGTTCGT TTGAAGTACA GCGCGCCACC
CGTCTGGCGC AGCAACAAGC GATGTATGAA AGCCAGCAGG AACGCGTAGC GCATCTGCAA
AGTTATATCG ACCGTTTCCG TGCCAAAGCC ACCAAAGCGA AGCAGGCCCA GAGCCGCATT
AAGATGCTCG AGCGTATGGA GCTGATTGCC CCCGCGCACG TCGACAACCC GTTCCGCTTT
AGCTTCCGCG CGCCGGAAAG CCTGCCAAAT CCGTTACTGA AGATGGAAAA AGTCAGCGCG
GGCTATGGCG ATCGCATTAT TCTCGACTCG ATTAAACTGA ACCTGGTGCC CGGCTCGCGC
ATTGGTCTGT TAGGCCGCAA CGGCGCGGGT AAATCGACAT TAATCAAACT GTTAGCCGGT
GAACTTGCGC CAGTCAGCGG TGAAATTGGT CTGGCGAAAG GGATCAAGCT CGGCTACTTC
GCCCAGCATC AACTTGAATA CCTGCGCGCC GACGAATCAC CTATTCAACA TCTGGCACGT
TTAGCGCCGC AGGAGCTGGA ACAAAAACTG CGTGACTACC TCGGCGGCTT TGGTTTCCAG
GGCGATAAAG TAACCGAAGA AACGCGCCGC TTCTCAGGTG GGGAAAAAGC CCGCCTGGTG
CTGGCATTAA TCGTCTGGCA GCGTCCGAAT CTGCTGCTGC TCGACGAACC GACTAACCAC
CTTGACCTCG ACATGCGTCA GGCACTCACC GAAGCATTAA TCGAGTTTGA AGGCGCGCTG
GTTGTCGTTT CGCACGACCG TCATTTGCTG CGTTCCACCA CTGACGATCT CTACCTGGTT
CACGATCGTA AAGTCGAACC GTTCGACGGC GATCTGGAAG ATTATCAACA GTGGTTGAGC
GACGTACAAA AGCAGGAAAA CCAGGCCGAC GAAGCGCCAA AAGAGAACGC GAACAGCGCC
CAGGCACGTA AAGATCAGAA GCGCCGGGAA GCGGAGCTGC GTGCGCAAAC CCAGCCACTG
CGTAAAGAGA TTGCCCGTCT GGAAAAAGAG ATGGAGAAGC TGAACGCGCA ACTGGCGCAG
GCGGAAGAGA AACTCGGCGA CAGCGAACTG TATGACCAGA GCCGTAAAGC GGAGTTGACC
GCCTGCCTGC AACAGCAAGC CAGCGCCAAA TCCGGCCTGG AAGAGTGCGA AATGGCGTGG
CTGGAAGCCC AGGAGCAGCT TGAGCAGATG TTGCTGGAAG GCCAAAGCAA CTGA
 
Protein sequence
MIVFSSLQIR RGVRVLLDNA TATINPGQKV GLVGKNGCGK STLLALLKNE ISADGGSYTF 
PGSWQLAWVN QETPALPQAA LEYVIDGDRE YRQLEAQLHD ANERNDGHAI ATIHGKLDAI
DAWSIRSRAA SLLHGLGFSN EQLERPVSDF SGGWRMRLNL AQALICRSDL LLLDEPTNHL
DLDAVIWLEK WLKSYQGTLI LISHDRDFLD PIVDKIIHIE QQSMFEYTGN YSSFEVQRAT
RLAQQQAMYE SQQERVAHLQ SYIDRFRAKA TKAKQAQSRI KMLERMELIA PAHVDNPFRF
SFRAPESLPN PLLKMEKVSA GYGDRIILDS IKLNLVPGSR IGLLGRNGAG KSTLIKLLAG
ELAPVSGEIG LAKGIKLGYF AQHQLEYLRA DESPIQHLAR LAPQELEQKL RDYLGGFGFQ
GDKVTEETRR FSGGEKARLV LALIVWQRPN LLLLDEPTNH LDLDMRQALT EALIEFEGAL
VVVSHDRHLL RSTTDDLYLV HDRKVEPFDG DLEDYQQWLS DVQKQENQAD EAPKENANSA
QARKDQKRRE AELRAQTQPL RKEIARLEKE MEKLNAQLAQ AEEKLGDSEL YDQSRKAELT
ACLQQQASAK SGLEECEMAW LEAQEQLEQM LLEGQSN