Gene EcHS_A1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1722 
Symbol 
ID5594887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1744074 
End bp1746086 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content53% 
IMG OID640920870 
Productfusaric acid resistance domain-containing protein 
Protein accessionYP_001458426 
Protein GI157161108 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.375298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAT CGTCATGGTC CTTGCGCAAT TTGCCCTGGT TCAGGGCCAC GCTGGCGCAA 
TGGCGTTATG CGTTACGCAA TACCATTGCC ATGTGTCTGG CGCTGACGGT TGCCTATTAT
TTAAATCTGG ATGAACCCTA TTGGGCGATG ACCTCGGCTG CAGTGGTTAG CTTTCCCACC
GTTGGCGGTG TTATCAGCAA AAGCCTCGGA CGCATCGCTG GCAGTTTGCT CGGAGCCATT
GCGGCACTGC TTCTTGCCGG GCATACGCTC AATGAGCCGT GGTTTTTTCT ATTGAGCATG
TCGGCGTGGC TTGGCTTTTG TACCTGGGCC TGTGCGCACT TCACGAATAA CGTCGCGTAT
GCATTTCAAC TGGCGGGCTA CACGGCTGCC ATCATCGCCT TTCCGATGGT TAATATTACT
GAGGCCAGCC AGCTGTGGGA TATCGCTCAG GCGCGCGTTT GCGAGGTGAT TGTCGGCATT
TTGTGCGGCG GCATGATGAT GATGATCCTG CCTAGCAGTT CCGATGCTAC AGCCCTTTTA
ACCGCATTGA AAAACATGCA CGCCCGACTA CTTGAACATG CCAGTTTACT CTGGCAGCCT
GAAACAACCG ATGCCATTCG TGCAGCACAT GAAGGGGTGA TTGGGCAGAT ACTGACCATG
AATTTGCTGC GTATCCAGGC TTTCTGGAGC CACTATCGTT TTCGCCAGCA AAACGCGCGC
CTTAATGCGC TGCTCCACCA GCAATTACGT ATGACCAGTG TCATCTCCAG CCTGCGACGT
ATGTTGCTCA ACTGGCCCTC ACCGCCAGGT GCCACACGAG AAATTCTCGA ACAGTTGCTG
ACGGCGCTCG CCAGTTCGCA AACAGATGTT TACACCGTCG CACGTATTAT CGCCCCGCTA
CGCCCGACCA ACGTCGCCGA CTATCGGCAC GTCGCCTTCT GGCAGCGACT ACGTTATTTT
TGCCGCCTTT ATCTGCAAAG TAGTCAGGAA TTACATCGTC TGCAAAGCGG TGTAGATGAT
CATACCAGAC TCCCACGGAC ATCCGGCCTG GCTCGTCATA CCGATAACGC CGAAGCTATG
TGGAGCGGGC TGCGTACATT TTGTACGTTG ATGATGATTG GCGCATGGAG TATTGCTTCG
CAATGGGATG CCGGTGCCAA TGCATTAACG CTGGCAGCAA TTAGCTGCGT ACTCTACTCC
GCCGTCGCAG CACCGTTTAA GTCGTTGTCA CTTCTGATGC GCACGCTGGT GTTACTTTCG
CTATTCAGCT TTGTGGTCAA ATTTGGTCTG ATGGTCCAGA TTAGCGATCT GTGGCAATTT
TTACTGTTTC TCTTTCCACT GCTGGCGACA ATGCAGCTTC TTAAATTGCA GATGCCAAAA
TTTGCCGCAT TGTGGGGGCA ACTGATTGTT TTTATGGGTT CTTTTATCGC TGTCACTAAT
CCCCCGGTGT ATGATTTTGC TGATTTTCTT AACGATAATC TGGCAAAAAT CGTTGGCGTC
GCGTTGGCGT GGTTAGCGTT CGCCATTCTG CGTCCAGGAT CGGATGCTCG TAAAAGCCGC
CGCCATATTC GCGCGCTGCG CCGGGATTTT GTCGATCAGC TAAGCCGCCA TCCAACACTG
AGTGAAAGCG AATTTGAATC GCTCACTTAT CATCACGTCA GTCAGTTGAG TAACAGCCAG
GATGCGCTGG CTCGCCGTTG GTTATTACGC TGGGGTGTAG TGCTGCTGAA CTGTTCTCAT
GTTGTCTGGC AATTGCGCGA CTGGGAATCG CGTTCCGATC CGTTATCGCG AGTACGGGAT
AACTGTATTT CACTGTTGCG GGGAGTGATG AGTGAGCGTG GCGTTCAGCA AAAATCACTG
GCGGCCACAC TTGAAGAATT ACAGCGGATT TGCGACAGCC TTGCCCGTCA TCATCAACCT
GCCGCCCGTG AGCTGGCGGC AATTGTCTGG CGGCTGTACT GCTCGCTTTC GCAACTTGAG
CAAGCACCAC CGCAAGGTAC GCTGGCCTCT TAA
 
Protein sequence
MNASSWSLRN LPWFRATLAQ WRYALRNTIA MCLALTVAYY LNLDEPYWAM TSAAVVSFPT 
VGGVISKSLG RIAGSLLGAI AALLLAGHTL NEPWFFLLSM SAWLGFCTWA CAHFTNNVAY
AFQLAGYTAA IIAFPMVNIT EASQLWDIAQ ARVCEVIVGI LCGGMMMMIL PSSSDATALL
TALKNMHARL LEHASLLWQP ETTDAIRAAH EGVIGQILTM NLLRIQAFWS HYRFRQQNAR
LNALLHQQLR MTSVISSLRR MLLNWPSPPG ATREILEQLL TALASSQTDV YTVARIIAPL
RPTNVADYRH VAFWQRLRYF CRLYLQSSQE LHRLQSGVDD HTRLPRTSGL ARHTDNAEAM
WSGLRTFCTL MMIGAWSIAS QWDAGANALT LAAISCVLYS AVAAPFKSLS LLMRTLVLLS
LFSFVVKFGL MVQISDLWQF LLFLFPLLAT MQLLKLQMPK FAALWGQLIV FMGSFIAVTN
PPVYDFADFL NDNLAKIVGV ALAWLAFAIL RPGSDARKSR RHIRALRRDF VDQLSRHPTL
SESEFESLTY HHVSQLSNSQ DALARRWLLR WGVVLLNCSH VVWQLRDWES RSDPLSRVRD
NCISLLRGVM SERGVQQKSL AATLEELQRI CDSLARHHQP AARELAAIVW RLYCSLSQLE
QAPPQGTLAS