Gene ECD_03372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03372 
SymbolyhjG 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3537478 
End bp3539553 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content53% 
IMG OID 
Productpredicted outer membrane biogenesis protein 
Protein accessionACT45173 
Protein GI253979503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAGC GAACGATGAG CAAGGCAGGC AAAATAACCG CTGCGATTTC AGGGGCTTTC 
TTGTTGTTGA TTGTCGTGGC GATCATTTTG ATTGCAACTT TTGACTGGAA TCGACTCAAA
CCGACTATCA ACCAGAAAGT CTCTGCGGAG TTGAATCGTC CGTTCGCTAT CCGTGGCGAT
CTGGGCGTGG TGTGGGAGCG GCAAAAACAA GAAACTGGCT GGCGCAGCTG GGTGCCGTGG
CCCCATGTAC ACGCGGAAGA CATCATTCTT GGCAATCCAC CGGATATTCC CGAAGTCACG
ATGGTGCATT TGCCACGCGT AGAGGCAACG CTGGCCCCGC TGGCGCTGCT GACCAAAACG
GTCTGGCTGC CGTGGATCAA GCTCGAAAAG CCCGACGCGC GCCTGATTCG CCTCTCTGAA
AAGAACAATA ACTGGACGTT TAATCTTGCC AACGATGATA ACAAAGACGC GAATGCAAAG
CCGTCGGCAT GGTCGTTTCG GCTGGATAAT ATTCTTTTCG ATCAAGGGCG GATCGCCATT
GATGACAAAG TAAGCAAAGC GGATCTGGAG ATTTTTGTTG ATCCCTTAGG CAAGCCGCTG
CCGTTCAGCG AAGTTACTGG ATCGAAAGGT AAAGCGGATA AAGAAAAGGT GGGCGATTAC
GTTTTTGGCC TGAAGGCGCA GGGACGATAT AACGGTGAAC CGCTCACGGG TACGGGAAAA
ATAGGCGGTA TGCTGGCGCT GCGTGGCGAA GGGACGCCGT TTCCGGTACA GGCTGATTTC
CGCTCTGGTA ACACCCGTGT TGCTTTTGAT GGCGTCGTGA ATGACCCAAT GAAGATGGGC
GGTGTCGATT TACGGCTTAA ATTTTCTGGC GATTCACTGG GTGATCTCTA TGAACTGACG
GGCGTTCTGC TGCCCGATAC CCCGCCGTTT GAAACGGATG GTCGGCTGGT AGCGAAAATC
GACACTGAAA AATCGTCGGT CTTTGATTAT CGCGGTTTTA ATGGGCGAAT TGGTGATAGC
GATATCCACG GTTCTCTGGT CTACACCACC GGAAAGCCAC GACCAAAACT GGAAGGTGAT
GTCGAGTCGC GGCAATTGCG GCTGGCGGAC CTGGGACCGT TGATTGGCGT TGATTCCGGG
AAAGGGGCAG AAAAGTCGAA ACGGTCTGAA CAGAAGAAGG GCGAAAAAAG CGTTCAGCCT
GCGGGCAAAG TGCTGCCTTA TGACCGCTTC GAAACCGATA AATGGGACGT TATGGATGCC
GATGTTCGCT TCAAAGGGCG GCGCATTGAG CATGGCAGTA GCCTGCCGAT TAGCGATCTT
TCTACTCATA TCATCCTCAA AAATGCTGAC CTGCGCCTGC AACCGCTGAA ATTTGGCATG
GCGGGCGGCA GCATTGCGGC GAATATTCAT CTGGAAGGCG ATAAAAAGCC GATGCAGGGG
CGGGCAGATA TTCAGGCTCG TCGACTGAAA CTGAAAGAAC TGATGCCCGA TGTGGAACTG
ATGCAGAAGA CGCTGGGGGA AATGAACGGT GACGCGGAAC TACGCGGTAG CGGTAACTCG
GTGGCGGCAC TTTTAGGCAA CAGTAACGGC AACCTGAAAC TGTTGATGAA TGACGGGCTG
GTGAGCCGCA ACCTGATGGA GATTGTTGGG CTGAATGTCG GCAACTACAT TGTCGGTGCG
ATATTTGGTG ATGATGAGGT GCGGGTGAAC TGCGCGGCGG CGAATCTGAA TATTGCCAAC
GGCGTGGCGC GCCCGCAGAT TTTTGCTTTC GATACTGAGA ACGCGTTGAT TAATGTTACC
GGCACGGCAA GTTTTGCTTC GGAACAGCTG GATTTGACTA TTGATCCGGA GAGTAAAGGA
ATTCGGATTA TCACACTGCG TTCGCCGCTG TATGTGCGGG GGACGTTTAA AAATCCGCAG
GCTGGGGTGA AAGCCGGACC GCTGATTGCC CGTGGTGCTG TTGCTGCGGC ACTGGCAACG
CTGGTAACAC CGGCGGCGGC GTTACTGGCA CTGATCTCAC CTTCCGAAGG GGAGGCTAAT
CAGTGTCGGA CGATTTTGTC GCAGATGAAG AAGTGA
 
Protein sequence
MRKRTMSKAG KITAAISGAF LLLIVVAIIL IATFDWNRLK PTINQKVSAE LNRPFAIRGD 
LGVVWERQKQ ETGWRSWVPW PHVHAEDIIL GNPPDIPEVT MVHLPRVEAT LAPLALLTKT
VWLPWIKLEK PDARLIRLSE KNNNWTFNLA NDDNKDANAK PSAWSFRLDN ILFDQGRIAI
DDKVSKADLE IFVDPLGKPL PFSEVTGSKG KADKEKVGDY VFGLKAQGRY NGEPLTGTGK
IGGMLALRGE GTPFPVQADF RSGNTRVAFD GVVNDPMKMG GVDLRLKFSG DSLGDLYELT
GVLLPDTPPF ETDGRLVAKI DTEKSSVFDY RGFNGRIGDS DIHGSLVYTT GKPRPKLEGD
VESRQLRLAD LGPLIGVDSG KGAEKSKRSE QKKGEKSVQP AGKVLPYDRF ETDKWDVMDA
DVRFKGRRIE HGSSLPISDL STHIILKNAD LRLQPLKFGM AGGSIAANIH LEGDKKPMQG
RADIQARRLK LKELMPDVEL MQKTLGEMNG DAELRGSGNS VAALLGNSNG NLKLLMNDGL
VSRNLMEIVG LNVGNYIVGA IFGDDEVRVN CAAANLNIAN GVARPQIFAF DTENALINVT
GTASFASEQL DLTIDPESKG IRIITLRSPL YVRGTFKNPQ AGVKAGPLIA RGAVAAALAT
LVTPAAALLA LISPSEGEAN QCRTILSQMK K