Gene ECD_03438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03438 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3615417 
End bp3617396 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content55% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACT45237 
Protein GI253979567 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATTT CGGAAGTCGA TCTGCGTAAA CTGACGGTCA GCGATCCGTT CCTCGGTCAG 
TACCAACAAC TGGTCCGCGA CGTGGTGATT TCTTATCAAT GGGATGCCTT GAACGATCGT
ATCCCAGAAG CGGAACCCAG CCATGCGATT GAAAACTTTC GCATTGCTGC CGGACTTCAG
GAGGGTGAAT TTTACGGGAT GGTGTTTCAG GACAGCGACG TCGCCAAATG GCTGGAAGCG
GTAGCCTGGT CGCTGTGCCA GAAGCCGGAC GCCGAACTGG AAAAAACCGC CGACGAGGTA
ATCGAACTGA TCGCCTCCGC CCAATGTGAA GACGGCTATC TCAATACTTA CTTTACGGTA
AAAGCACCCG AAGAACGCTG GAGCAATCTT GCGGAGTGTC ATGAACTTTA CTGCGCCGGT
CATCTGATTG AAGCCGGAGT CGCCTTCTTC CAGGCCACGG GAAAACGACG CTTGCTGGAG
GTGGTTTGCC GTCTGGCCGA TCATATCGAC CGCGTATTTG GTCCAGATGA AAGTAAGTTA
CACGGTTATC CTGGTCACCC GGAAATTGAA CTGGCACTAA TGCGCCTGTA TGAAGTGACT
GAAGAGCCGC GCTACCTGGC GCTGACGAAC TATTTTGTCG AACAGCGTGG TGCGCAACCG
CACTATTACG ACCAAGAATA TGAAAAGCGC GGGCAGACAT CGCACTGGCA CACCTACGGC
CCGGCGTGGA TGGTGAAAGA CAAAGCCTAC AGCCAGGCAC ATTTGTCCCT TGCGCAACAG
CAAACCGCCA TCGGTCACGC GGTACGTTTT GTCTACCTGA TGACCGGCGT CGCGCATCTC
GCGCGTTTAA GTCACGATGA CAGCAAGCGT CAGGACTGCC TGAGGCTGTG GAACAATATG
GCCCAGCGTC AGTTATATAT TACCGGCGGC ATTGGCTCGC AAAGCAGCGG CGAAGCGTTC
ACTAGCGATT ACGATCTGCC GAATGACACG GTTTACGCCG AAAGTTGTGC TTCCATCGGC
CTGATGATGT TCGCCCGGCG AATGCTGGAA ATGGAAGGCG ACAGTCAATA TGCCGATGTG
ATGGAGCGCG CGCTGTACAA CACCGTGCTC GGCGGCATGG CGCTGGATGG CAAACATTTC
TTCTATGTGA ATCCGCTGGA AGTACATCCA AAATCGCTGA AATTCAACCA TATCTACGAT
CACGTTAAAC CGATCCGCCA GCGTTGGTTT GGCTGCGCTT GTTGTCCGCC AAATATCGCC
CGCGTGCTGA CCTCGATTGG TCATTATCTC TACACGCCGC GTGAAGATGC GTTGTATATC
AACATATACG CAGGAAACAG CATGGAAGTG CCGGTAGAAA ATGGCACGCT GCGCCTGCGG
GTTAGCGGGA ACTATCCGTG GCAGGAGCAG GTGACGATTG CGGTTGAATC GCCCCAGCCG
GTACGTCATA CGCTGGCTTT ACGTCTGCCG GACTGGTGCA CACAGCCGCA GATCATATTG
AATGGGGAAG AGGTCGAGCA GGATATTCGT AAAGGGTATT TGCACATTAC CCGCGAATGG
CAGGAGGGCG ATACGCTGAA TCTGACTTTG CCGATGCCGG TACGCCGCGT TTACGGTAAC
CCGCTGGTGC GTCACGTCGC CGGAAAAGTG GCGATTCAGC GCGGCCCGCT GGTGTATTGC
CTGGAACAGG CCGACAACGG CGAGTCACTG CATAATCTGT GGCTGCCCAC CGATGCGCCA
TTTACGACAT TTGAAGGCAA GGGATTGTTT AGCCATAAGA TCTTAATCCA GGCACCGGGT
TACCGGTATG AACAGAGCAA TCCAGAGCAG CAACCGCTGT GGCATTACGA CAGCGCGCCA
GCCAAACGCC AGCCGCAAAC TCTGACGTTT ATCCCGTGGT TTAGCTGGGC TAACCGGGGC
GAAGGCGAAA TGCGGATCTG GGTGAATGAG GAAAAGCATC GCCATCCGGA GGTTGGATAA
 
Protein sequence
MNISEVDLRK LTVSDPFLGQ YQQLVRDVVI SYQWDALNDR IPEAEPSHAI ENFRIAAGLQ 
EGEFYGMVFQ DSDVAKWLEA VAWSLCQKPD AELEKTADEV IELIASAQCE DGYLNTYFTV
KAPEERWSNL AECHELYCAG HLIEAGVAFF QATGKRRLLE VVCRLADHID RVFGPDESKL
HGYPGHPEIE LALMRLYEVT EEPRYLALTN YFVEQRGAQP HYYDQEYEKR GQTSHWHTYG
PAWMVKDKAY SQAHLSLAQQ QTAIGHAVRF VYLMTGVAHL ARLSHDDSKR QDCLRLWNNM
AQRQLYITGG IGSQSSGEAF TSDYDLPNDT VYAESCASIG LMMFARRMLE MEGDSQYADV
MERALYNTVL GGMALDGKHF FYVNPLEVHP KSLKFNHIYD HVKPIRQRWF GCACCPPNIA
RVLTSIGHYL YTPREDALYI NIYAGNSMEV PVENGTLRLR VSGNYPWQEQ VTIAVESPQP
VRHTLALRLP DWCTQPQIIL NGEEVEQDIR KGYLHITREW QEGDTLNLTL PMPVRRVYGN
PLVRHVAGKV AIQRGPLVYC LEQADNGESL HNLWLPTDAP FTTFEGKGLF SHKILIQAPG
YRYEQSNPEQ QPLWHYDSAP AKRQPQTLTF IPWFSWANRG EGEMRIWVNE EKHRHPEVG