Gene EcHS_A2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2080 
Symbol 
ID5594403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2065622 
End bp2067616 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content53% 
IMG OID640921221 
Producthypothetical protein 
Protein accessionYP_001458765 
Protein GI157161447 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00000313758 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAAA GATATAATAC CGGCAATCCA AGACCTTCAA ATAGCATGAA GGATCTGAAT 
GATAACGCCC TGGCGTACGA TGATTTCCTG AACAGCGAAA GCGATACTTT TATAGATCGT
TTTGGTAACG CCCAGGATAC GATAATTGGG GCTACTAAAA AAATGGCAGC TGCTACCGAC
GCTGTTATTG ATGAAGCCCG CCAAAACCTG ATCCCTCTCA GCCGGCAGTA CATGACGCTG
GCGGCGGCGC AGGCGGATAT TGCGAATATT CCGGCAGGTT CAACAACCTA TGTCCGCAGT
CAGGACGGAA GCTCTCTGGC CGATGAGTAT ATCAACCTCG CTGGAACGCT GCAGCCAACC
GGACGGCGGA TGGTTCGTGA CGACTACGCA TACCAGGTAT CGCCAGACAG CGTGACCCTG
GCAGCATATG ATCCGGAGAC TTCCCGCGTG GCTCCATTTT TAAATACAAG CGGCAGATTA
ATTCAAATCG GTCCTGACGG AAAATATTAC GAACTTTTAA CCCAACAAGA ATCCGAACTC
TATGCGCTGG GCCGGGAGGG TTCTATACCG CAGTTTATTG GCGGTGAAAA AGTGTGGCGG
ATGACGGTTG ATTCAACCAC AAACCAGATC GTTGAAGCTT ATACGGTTGG TGGGAAGCAC
TGGATTTACT CAGACGGTGG CCTGGTAGCT GTTAATAACG GAAATGGCGG TGGTGGTGGC
GACGATGATG CCAACCAGCT CCCTGAGTAT GGACTTCATT TGTCAGGGTC TACTGTGTAC
CCCTACTCAG AGACAGTGCC TGTATGTTTT ATCTTTGTGA CTGCTGGGCA ATCCAACGCT
CGAGGATATT GTCCTGACGC CGATCAAACC ATTGTCGCAG CAACGCCGAT ATATCCTGAT
AACGCTTTCA TGCTCAGCGG CGGGGTTAGG CGTACAGGGA CACGCAGCAC TACTCTGGTG
CCACTGGTTG AGGCAGTAAG TGGGACAGAT AAAGAAACGG CCGCAAGCGG CCTCGCGAAC
ACCTTCATTC GCGATATGGC TGCAGCTACC GGAATCATGC CGCGCACGCT ATCAATCGTA
TGTGCGCAGT CTGGTCAGGC TTACGAGTAC CAGAAACGGG GTAACCAGGT ATATCAGTAT
CTGCTCGATT CAATCGAAGA CTGCGTAACG GCCTGTAAAG CACGCGGCTG GCTGCCGATT
GTTCTCTGCG TTGACTGGAT GCAGGGAGAG TCCGACGAGG ACTGGTCAGG ATTACGAGAA
GGAATGTATG AATCACGGAT GAGGCAGTAC CAGAGACAAA TCACCAGCGA CATCATCGCA
AGAACGGGTC AAAACGAACC GCCGATTATC GCCATTACCC AGCTGGGGTA TGTCAATGAC
GGGCATGGTG CATTTACAGG CCAGTACGCG CGACTGGCGT CGACGCGATT GCACGGAAAA
GAGCAATTCA GGCTGGTCAA TAGTTTGTAC CAGTACGATT TTATTTCAGA CGGTCTGCAC
TTGACGTGTG CGGGCCAGAA CCGGCGCGGA GCAGCTGTGG CGAGAGCGCT TCTCCAGGAG
TGGTTTACGA GCGGCTGGTC AGGGATGGTT CCGACCAGTT TCGTGTGGAA CTCACCCACG
CAGATACAAA TCAATGTCCC AGCGTATACG AACCTGGTGC TGGACACGAC TACGATCAAC
ACCTCCGGTC TGGCCAATTA CGGCTTTAGC TACACGGATG AGACTGGTGC TCCACCTGCT
ATATCGAGCA TCGCGATCAG CTCGGACGGC AAGGGCGTGC TGATTAACCT GGCGACCGCC
CCCTCTGGAC GTTTTGGGCG CGTTTCCTAT GCGACAGCAG AAAACCCACT TCAGAGCGGC
GCATCTGTAA AACCTTCCGG GCGGACTCTT GGTGCAAGAG GGTGTGTTCG ATCTTCCGCT
GGAATCATAT GGGTGTATGA CACATCCGTG ACTCTTTACG ACTGGCTCCC CGCTTTTCGT
ATTAACGTTT TCTGA
 
Protein sequence
MDKRYNTGNP RPSNSMKDLN DNALAYDDFL NSESDTFIDR FGNAQDTIIG ATKKMAAATD 
AVIDEARQNL IPLSRQYMTL AAAQADIANI PAGSTTYVRS QDGSSLADEY INLAGTLQPT
GRRMVRDDYA YQVSPDSVTL AAYDPETSRV APFLNTSGRL IQIGPDGKYY ELLTQQESEL
YALGREGSIP QFIGGEKVWR MTVDSTTNQI VEAYTVGGKH WIYSDGGLVA VNNGNGGGGG
DDDANQLPEY GLHLSGSTVY PYSETVPVCF IFVTAGQSNA RGYCPDADQT IVAATPIYPD
NAFMLSGGVR RTGTRSTTLV PLVEAVSGTD KETAASGLAN TFIRDMAAAT GIMPRTLSIV
CAQSGQAYEY QKRGNQVYQY LLDSIEDCVT ACKARGWLPI VLCVDWMQGE SDEDWSGLRE
GMYESRMRQY QRQITSDIIA RTGQNEPPII AITQLGYVND GHGAFTGQYA RLASTRLHGK
EQFRLVNSLY QYDFISDGLH LTCAGQNRRG AAVARALLQE WFTSGWSGMV PTSFVWNSPT
QIQINVPAYT NLVLDTTTIN TSGLANYGFS YTDETGAPPA ISSIAISSDG KGVLINLATA
PSGRFGRVSY ATAENPLQSG ASVKPSGRTL GARGCVRSSA GIIWVYDTSV TLYDWLPAFR
INVF