Gene ECD_10039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_10039 
Symbol
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp772892 
End bp774211 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content61% 
IMG OID 
Productcapsid component 
Protein accessionACT42621 
Protein GI253976951 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGCAG AGCTGCGTAA TCTCCCGCAT ATTGCCAGCA TGGCCTTTAA TGAGCCGCTG 
ATGCTTGAAC CCGCCTATGC GCGGGTTTTC TTTTGTGCGC TTGCAGGCCA GCTTGGGATC
AGCAGCCTGA CGGATGCGGT GTCCGGCGAC AGCCTGACTG CCCAGGAGGC ACTCGCGACG
CTGGCATTAT CCGGTGATGA TGACGGACCA CGACAGGCCC GCAGTTATCA GGTCATGAAC
GGCATCGCCG TGCTGCCGGT GTCCGGCACG CTGGTCAGCC GGACGCGGGC GCTGCAGCCG
TACTCGGGGA TGACCGGTTA CAACGGCATT ATCGCCCGTC TGCAACAGGC TGCCAGCGAT
CCGATGGTGG ACGGCATTCT GCTCGATATG GACACGCCCG GCGGGATGGT GGCGGGGGCA
TTTGACTGCG CTGACATCAT CGCCCGTGTG CGTGACATAA AACCGGTATG GGCGCTTGCC
AACGACATGA ACTGCAGTGC AGGTCAGTTG CTTGCCAGTG CCGCCTCCCG GCGTCTGGTC
ACGCAGACCG CCCGGACAGG CTCCATCGGC GTCATGATGG CTCACAGTAA TTACGGTGCT
GCGCTGGAGA AACAGGGTGT GGAAATCACG CTGATTTACA GCGGCAGCCA TAAGGTGGAT
GGCAACCCCT ACAGCCATCT TCCGGATGAC GTCCGGGAGA CACTGCAGTC CCGGATGGAC
GCAACCCGCC AGATGTTTGC GCAGAAGGTG TCGGCATATA CCGGCCTGTC CGTGCAGGTT
GTGCTGGATA CCGAGGCTGC AGTGTACAGC GGTCAGGAGG CCATTGATGC CGGACTGGCT
GATGAACTTG TTAACAGCAC CGATGCGATC ACCGTCATGC GTGATGCACT GGATGCACGT
AAATCCCGTC TCTCAGGAGG GCGAATGACC AAAGAGACTC AATCAACAAC TGTTTCAGCC
ACTGCTTCGC AGGCTGACGT TACTGACGTG GTGCCAGCGA CGGAGGGCGA GAACGCCAGC
GCGGCGCAGC CGGACGTGAA CGCGCAGATC ACCGCAGCGG TTGCGGCAGA AAACAGCCGC
ATTATGGGGA TACTCAACTG TGAGGAGGCT CACGGACGCG AAGAACAGGC ACGCGTGCTG
GCAGAAACCC CCGGTATGAC CGTGAAAACG GCCCGCCGCA TTCTGGCCGC AGCACCACAG
AGTGCACAGG CGCGCAGTGA CACTGCGCTG GATCGTCTGA TGCAGGGGGC ACCGGCACCG
CTGGCTGCAG GTAACCCGGC ATCTGATGCC GTTAACGATT TGCTGAACAC ACCAGTGTAA
 
Protein sequence
MTAELRNLPH IASMAFNEPL MLEPAYARVF FCALAGQLGI SSLTDAVSGD SLTAQEALAT 
LALSGDDDGP RQARSYQVMN GIAVLPVSGT LVSRTRALQP YSGMTGYNGI IARLQQAASD
PMVDGILLDM DTPGGMVAGA FDCADIIARV RDIKPVWALA NDMNCSAGQL LASAASRRLV
TQTARTGSIG VMMAHSNYGA ALEKQGVEIT LIYSGSHKVD GNPYSHLPDD VRETLQSRMD
ATRQMFAQKV SAYTGLSVQV VLDTEAAVYS GQEAIDAGLA DELVNSTDAI TVMRDALDAR
KSRLSGGRMT KETQSTTVSA TASQADVTDV VPATEGENAS AAQPDVNAQI TAAVAAENSR
IMGILNCEEA HGREEQARVL AETPGMTVKT ARRILAAAPQ SAQARSDTAL DRLMQGAPAP
LAAGNPASDA VNDLLNTPV