Gene ECD_10038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_10038 
Symbol
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp771310 
End bp772911 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content58% 
IMG OID 
Productcapsid component 
Protein accessionACT42620 
Protein GI253976950 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACGC CCACCATTCC CACCCTTCTG GGGCCGGACG GCATGACATC GCTGCGCGAA 
TATGCCGGTT ATCACGGCGG TGGCAGCGGA TTTGGAGGGC AGTTGCGGTC GTGGAACCCA
CCGAGTGAAA GTGTGGATGC AGCCCTGTTG CCCAACTTTA CCCGTGGCAA TGCCCGCGCA
GACGATCTGG TACGCAATAA CGGCTATGCC GCCAACGCCA TCCAGCTGCA TCAGGATCAT
ATCGTCGGGT CTTTTTTCCG GCTCAGTCAT CGCCCAAGCT GGCGCTATCT GGGCATCGGG
GAGGAAGAAG CCCGTGCCTT TTCCCGCGAG GTTGAAGCGG CATGGAAAGA GTTTGCCGAG
GATGACTGCT GCTGCATTGA CGTTGAGCGA AAACGCACGT TTACCATGAT GATTCGGGAA
GGTGTGGCCA TGCACGCCTT TAACGGTGAA CTGTTCGTTC AGGCCACCTG GGATACCAGT
TCGTCGCGGC TTTTCCGGAC ACAGTTCCGG ATGGTCAGCC CGAAGCGCAT CAGCAACCCG
AACAATACCG GCGACAGCCG GAACTGCCGT GCCGGTGTGC AGATTAATGA CAGCGGTGCG
GCGCTGGGAT ATTACGTCAG CGAGGACGGG TATCCTGGCT GGATGCCGCA GAAATGGACA
TGGATACCCC GTGAGTTACC CGGCGGGCGC GCCTCGTTCA TTCACGTTTT TGAACCCGTG
GAGGACGGGC AGACTCGCGG TGCAAATGTG TTTTACAGCG TGATGGAGCA GATGAAGATG
CTCGACACGC TGCAGAACAC GCAGCTGCAG AGCGCCATTG TGAAGGCGAT GTATGCCGCC
ACCATTGAGA GTGAGCTGGA TACGCAGTCA GCGATGGATT TTATTCTGGG CGCGAACAGT
CAGGAGCAGC GGGAAAGGCT GACCGGCTGG ATTGGTGAAA TTGCCGCGTA TTACGCCGCA
GCGCCGGTCC GGCTGGGAGG CGCAAAAGTA CCGCACCTGA TGCCGGGTGA CTCACTGAAC
CTGCAGACGG CTCAGGATAC GGATAACGGC TACTCCGTGT TTGAGCAGTC ACTGCTGCGG
TATATCGCTG CCGGGCTGGG TGTCTCGTAT GAGCAGCTTT CCCGGAATTA CGCCCAGATG
AGCTACTCCA CGGCACGGGC CAGTGCGAAC GAGTCGTGGG CGTACTTTAT GGGGCGGCGA
AAATTCGTCG CATCCCGTCA GGCGAGCCAG ATGTTTCTGT GCTGGCTGGA AGAGGCCATC
GTTCGCCGCG TGGTGACGTT ACCTTCAAAA GCGCGCTTCA GTTTTCAGGA AGCCCGCAGT
GCCTGGGGGA ACTGCGACTG GATAGGCTCC GGTCGTATGG CCATCGATGG TCTGAAAGAA
GTTCAGGAAG CGGTGATGCT GATAGAAGCC GGACTGAGTA CCTACGAGAA AGAGTGCGCA
AAACGCGGTG ACGACTATCA GGAAATTTTT GCCCAGCAGG TCCGTGAAAC GATGGAGCGC
CGTGCAGCCG GTCTTAAACC GCCCGCCTGG GCGGCTGCAG CATTTGAATC CGGGCTGCGA
CAATCAACAG AGGAGGAGAA GAGTGACAGC AGAGCTGCGT AA
 
Protein sequence
MKTPTIPTLL GPDGMTSLRE YAGYHGGGSG FGGQLRSWNP PSESVDAALL PNFTRGNARA 
DDLVRNNGYA ANAIQLHQDH IVGSFFRLSH RPSWRYLGIG EEEARAFSRE VEAAWKEFAE
DDCCCIDVER KRTFTMMIRE GVAMHAFNGE LFVQATWDTS SSRLFRTQFR MVSPKRISNP
NNTGDSRNCR AGVQINDSGA ALGYYVSEDG YPGWMPQKWT WIPRELPGGR ASFIHVFEPV
EDGQTRGANV FYSVMEQMKM LDTLQNTQLQ SAIVKAMYAA TIESELDTQS AMDFILGANS
QEQRERLTGW IGEIAAYYAA APVRLGGAKV PHLMPGDSLN LQTAQDTDNG YSVFEQSLLR
YIAAGLGVSY EQLSRNYAQM SYSTARASAN ESWAYFMGRR KFVASRQASQ MFLCWLEEAI
VRRVVTLPSK ARFSFQEARS AWGNCDWIGS GRMAIDGLKE VQEAVMLIEA GLSTYEKECA
KRGDDYQEIF AQQVRETMER RAAGLKPPAW AAAAFESGLR QSTEEEKSDS RAA