Gene B21_01831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01831 
Symbolunknown 
ID8113743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1899811 
End bp1900839 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content50% 
IMG OID644848051 
Producthypothetical protein 
Protein accessionYP_002999624 
Protein GI251785320 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCACCAGG ATACTAATCG TCTGATTAAA GCATGGCAGA AACCGGAGAT GATCGTCGTT 
TCTGAATGCT ACTGGACCGC AGCTGCTAAA CATGCAGATA TCGTATTACC GATCACCACA
TCGTTTGAGC GCAATGACTT GACGATGACC GGTGATTACA GTAACCAGCA TATTGTGCCG
ATGAAGCAGG CTGTCGCTCC GCAATTTGAA GCGCGAAACG ATTTTGATGT GTTTGCCGAT
CTTGCGGAAT TACTGAAACC TGGCGGAAAA GAGATCTATA CCGAAGGTAA AGATGAAATG
GCGTGGCTGA AATTTTTCTA TGATGCCGCT CAGAAAGGTG CCCGTGCGCA ACGCGTGACT
ATGCCGATGT TTAATGCCTT CTGGCAGCAA AATAAACTGA TCGAAATGCG CCGCAGCGAG
AAGAACGAAC AGTACGTTCG TTATGGTGAT TTCCGCGCCG ATCCGGTGAA AAATGCGCTG
GGTACGCCAA GCGGCAAAAT TGAGATTTAC TCCAAAACGC TGGAAAAATT TGGCTATAAG
GATTGCCCGG CACACCCAAC CTGGCTTGCG CCTGATGAGT GGAAGGGTAC CGCCGACGAG
AAGCAGTTGC AGCTTCTGAC CGCACATCCG GCACACCGTT TACATAGTCA GCTTAACTAT
GCGGAACTGC GTAAAAAATA TGCGGTTGCA GATCGTGAAC CAATCACTAT TCACACCGAA
GATGCTGCTC GCTTTGGTAT TGCGAATGGT GATCTGGTGC GTGTGTGGAA CAAACGTGGT
CAGATTCTGA CAGGCGCGGT GGTGACTGAC GGGATCAAAA AAGGCGTGGT ATGCGTGCAT
GAAGGTGCAT GGCCAGATCT GGAAAATGGC TTGTGTAAAA ACGGCAGTGC GAACGTGTTA
ACGGCGGATA TCCCCAGCTC GCAGCTGGCA AATGCCTGTG CCGGTAACTC TGCGCTGGTG
TATATCGAAA AATATACGGG CAATGCGCCG AAGTTAACGG CGTTTGATCA GCCAGCTATT
CAGGCATAA
 
Protein sequence
HHQDTNRLIK AWQKPEMIVV SECYWTAAAK HADIVLPITT SFERNDLTMT GDYSNQHIVP 
MKQAVAPQFE ARNDFDVFAD LAELLKPGGK EIYTEGKDEM AWLKFFYDAA QKGARAQRVT
MPMFNAFWQQ NKLIEMRRSE KNEQYVRYGD FRADPVKNAL GTPSGKIEIY SKTLEKFGYK
DCPAHPTWLA PDEWKGTADE KQLQLLTAHP AHRLHSQLNY AELRKKYAVA DREPITIHTE
DAARFGIANG DLVRVWNKRG QILTGAVVTD GIKKGVVCVH EGAWPDLENG LCKNGSANVL
TADIPSSQLA NACAGNSALV YIEKYTGNAP KLTAFDQPAI QA