Gene ECD_01976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01976 
SymbolyegI 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2048038 
End bp2049978 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content52% 
IMG OID 
Producthypothetical protein 
Protein accessionACT43827 
Protein GI253978157 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCCA CTTTATATAC TGCTACTGGT GAGTGCGTTA CGCCAGGCCG TGAACTGGGC 
AAAGGCGGTG AAGGTGCGGT TTATGATATC GAGGAGTTTG CCGATAGCGT CGCCAAGATT
TATCACACGC CGCCACCCGC CTTAAAACAG GACAAACTTG CCTTTATGGC TGCGACAGCT
GACGCGCAGT TGTTGAATTA TGTCGCCTGG CCGCAGGCAA CGCTTCACGG TGGGCGAGGC
GGAAAAGTTA TCGGTTTTAT GATGCCAAAA GTTTCTGGTA AAGAACCGAT TCATATGATC
TATAGCCCGG CACATCGTCG CCAGAGTTAC CCTCATTGTG CGTGGGATTT TCTACTCTAT
GTTGCGCGCA ATATTGCTTC ATCTTTTGCT ACGGTTCACG AGCACGGGCA CGTCGTGGGG
GACGTAAACC AGAACAGCTT TATGGTAGGT CGCGACAGCA AAGTGGTGTT GATCGATAGT
GACTCCTTTC AGATTAACGC CAATGGCACA CTGCATTTAT GCGAAGTCGG CGTGTCGCAT
TTTACGCCGC CAGAGCTACA AACCTTGCCA TCATTTGTCG GTTTTGAACG TACCGCGAAT
CACGATAATT TTGGCCTTGC GTTGCTGATT TTTCACGTCT TGTTTGGTGG GCGGCATCCT
TATTCTGGTG TGCCGCTTAT CTCTGATGCG GGTAATGCGC TGGAGACGGA TATTGCCCAT
TTCCGTTATG CCTACGCGTC AGATAATCAG CGACGTGGTT TAAAACCGCC GCCACGATCG
ATTCCGCTGT CGATGTTACC GGGCGATGTT GAAGCCATGT TTCAGCAGGC GTTTACGGAA
AGTGGTGTAG CAACCGGGCG TCCGACGGCT AAAGCGTGGG TAGCAGCACT GGATTCTCTA
CGCCAACAGT TAAAGAAATG TACCGTTTCG GCAATGCATG TTTATCCCGC TCATTTGACC
GACTGCCCGT GGTGTACGCT GGATAATCAA GGCGTTATCT ATTTTATTGA TCTCGGCGAA
GAGGTCATTA CCACCGGCGG TGATTTTGTG CTGGCGAAAG TCTGGGCGAT GGTGATGGCG
TCAGTAGCAC CGCCAGCAGT GCAATTGCCA TTACCCGATC ATTTCCAACC GACTGGCAGG
CCGCTTCCTT TAGGCCTGTT ACGGCGCGAA TACATCATTC TGCTTGAGAT CGCACTGTCA
GCGTTATCGC TGTTGCTTTG CGGCCTTCAG GCAGAACCGC GTTATATTAT TTTGGTTCCT
GTGCTGGCGG CTATCTGGAT TATTGGCAGT CTGACAAGCA AAGCTTATAA AGCAGAAATC
CAGCAACGCC GTGAGGCATT TAATCGCGCG AAAATGGACT ATGACCATTT AGTCAGCCAG
ATCCAACAGT TGGGCGGGCT GGAAGGTTTT ATCGCCAAAC GGACGATGCT CGAAAAAATG
AAGGACGAAA TGCTCGGGTT ACCGGAGGAA GAAAAACGTG CTCTGGCAGC ACTTCACGAC
ACCGCAAGGG AACGGCAGAA GCAGAAGTTT CTGGAGGGAT TTTTTATTGA TGTTGCCTCT
ATTCCCGGTG TTGGCCCTGC GCGTAAAGCG GCGTTACGGT CCTTTGGTAT TGAAACAGCA
GCGGATGTTA CCCGTCGTGG GGTTAAGCAA GTTAAAGGGT TTGGTGATCA TCTGACCCAG
GCGGTCATCG ACTGGAAAGC GAGCTGTGAA CGCCGTTTTG TGTTCAGGCC GAACGAAGCG
GTAACGCCTG CAGACAGACA AGCGGTAATG GCGAAAGTGG CCGCCAAACG ACATCGGCTG
GAATCGGCGT TGACTGTCGG CGCGACAGAG TTGCAGCGAT TCCGCCTTCA TGCTCCAGCA
CGGACCATGC CGTTGATGGA ACCGTTACGT CAGGCGGCAG AAAAACTGGC TCAGGCGCAG
GCAGATTTAA GCCGCTGCTG A
 
Protein sequence
MKPTLYTATG ECVTPGRELG KGGEGAVYDI EEFADSVAKI YHTPPPALKQ DKLAFMAATA 
DAQLLNYVAW PQATLHGGRG GKVIGFMMPK VSGKEPIHMI YSPAHRRQSY PHCAWDFLLY
VARNIASSFA TVHEHGHVVG DVNQNSFMVG RDSKVVLIDS DSFQINANGT LHLCEVGVSH
FTPPELQTLP SFVGFERTAN HDNFGLALLI FHVLFGGRHP YSGVPLISDA GNALETDIAH
FRYAYASDNQ RRGLKPPPRS IPLSMLPGDV EAMFQQAFTE SGVATGRPTA KAWVAALDSL
RQQLKKCTVS AMHVYPAHLT DCPWCTLDNQ GVIYFIDLGE EVITTGGDFV LAKVWAMVMA
SVAPPAVQLP LPDHFQPTGR PLPLGLLRRE YIILLEIALS ALSLLLCGLQ AEPRYIILVP
VLAAIWIIGS LTSKAYKAEI QQRREAFNRA KMDYDHLVSQ IQQLGGLEGF IAKRTMLEKM
KDEMLGLPEE EKRALAALHD TARERQKQKF LEGFFIDVAS IPGVGPARKA ALRSFGIETA
ADVTRRGVKQ VKGFGDHLTQ AVIDWKASCE RRFVFRPNEA VTPADRQAVM AKVAAKRHRL
ESALTVGATE LQRFRLHAPA RTMPLMEPLR QAAEKLAQAQ ADLSRC