Gene ECD_01020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01020 
SymbolycdO 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1086526 
End bp1087653 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content51% 
IMG OID 
Producthypothetical protein 
Protein accessionACT42915 
Protein GI253977245 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.154697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATTA ACTTCCGCCG TAACGCATTG CAGTTGAGCG TGGCTGCGCT GTTTTCTTCT 
GCTTTTATGG CTAACGCCGC TGATGTGCCG CAGGTCAAAG TGACCGTGAC GGATAAGCAG
TGCGAACCGA TGACCATTAC GGTTAACGCC GGGAAAACAC AGTTCATTAT TCAGAACCAC
AGCCAGAAGG CGCTGGAGTG GGAGATCCTC AAAGGCGTGA TGGTGGTGGA AGAGCGGGAA
AATATCGCCC CTGGCTTTAG CCAGAAAATG ACGGCGAATT TACAGCCTGG CGAATACGAT
ATGACCTGCG GTCTGCTGAC TAACCCGAAA GGGAAGTTGA TCGTCAAAGG TGAGGCAACG
GCGGATGCGG CGCAAAGTGA TGCGCTGTTA AGTCTTGGTG GTGCAATTAC TGCATATAAA
GCGTATGTCA TGGCGGAAAC CACGCAGCTG GTGACCGACA CCAAAGCCTT TACCGACGCG
ATTAAAGCAG GCGATATCGA AAAAGCGAAA GCACTGTATG CACCGACGCG CCAGCACTAT
GAGCGTATTG AACCGATTGC TGAACTGTTC TCCGATCTGG ATGGCAGCAT TGACGCCCGT
GAAGATGATT ACGAGCAAAA AGCCGCCGAC CCAAAATTCA CTGGTTTCCA CCGTCTGGAA
AAAGCATTGT TTGGCGACAA CACCACCAAA GGGATGGATC AGTACGCTGA GCAGCTTTAT
ACCGATGTGG TCGATTTGCA AAAACGCATC AGTGAACTGG CTTTCCCACC TTCAAAAGTG
GTCGGCGGCG CAGCCGGACT GATTGAGGAA GTGGCAGCCA GCAAAATTAG CGGTGAAGAA
GATCGCTACA GCCACACCGA TCTGTGGGAT TTCCAGGCTA ACGTTGAAGG CTCGCAGAAA
ATTGTCGATT TGCTGCGTCC ACAACTGCAA AAAGCCAACC CGGAACTGCT GGCAAAAGTC
GATGCCAACT TTAAAAAGGT CGATACCATT CTGGCGAAAT ACCGTACTAA AGACGGTTTT
GAAACCTACG ACAAATTGAC CGATGCCGAC CGGAATGCAC TGAAAGGACC GATTACTGCG
CTGGCGGAAG ATCTGGCGCA ACTTCGCGGT GTGCTGGGAC TGGATTAA
 
Protein sequence
MTINFRRNAL QLSVAALFSS AFMANAADVP QVKVTVTDKQ CEPMTITVNA GKTQFIIQNH 
SQKALEWEIL KGVMVVEERE NIAPGFSQKM TANLQPGEYD MTCGLLTNPK GKLIVKGEAT
ADAAQSDALL SLGGAITAYK AYVMAETTQL VTDTKAFTDA IKAGDIEKAK ALYAPTRQHY
ERIEPIAELF SDLDGSIDAR EDDYEQKAAD PKFTGFHRLE KALFGDNTTK GMDQYAEQLY
TDVVDLQKRI SELAFPPSKV VGGAAGLIEE VAASKISGEE DRYSHTDLWD FQANVEGSQK
IVDLLRPQLQ KANPELLAKV DANFKKVDTI LAKYRTKDGF ETYDKLTDAD RNALKGPITA
LAEDLAQLRG VLGLD