Gene B21_02821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02821 
SymbolhybA 
ID8113208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3010792 
End bp3011778 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content55% 
IMG OID644849010 
Producthypothetical protein 
Protein accessionYP_003000583 
Protein GI251786279 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACAGAC GTAATTTTAT TAAAGCAGCC TCCTGCGGGG CATTGCTGAC GGGCGCGCTG 
CCGTCTGTCA GTCATGCGGC TGCTGAAAAC CGCCCGCCAA TTCCGGGATC GCTGGGGATG
TTGTACGACT CGACCTTGTG CGTAGGCTGC CAGGCTTGCG TCACCAAGTG TCAGGATATC
AATTTCCCTG AACGTAACCC GCAAGGGGAA CAGACCTGGT CGAACAACGA CAAACTGTCG
CCGTATACCA ATAACATCAT TCAGGTGTGG ACCAGCGGCA CAGGGGTCAA CAAAGACCAG
GAGGAGAACG GCTACGCGTA CATTAAGAAA CAGTGTATGC ACTGCGTCGA TCCGAACTGT
GTCTCTGTGT GCCCGGTCTC TGCACTGAAA AAAGATCCGA AAACCGGCAT TGTCCATTAC
GACAAAGATG TGTGCACCGG CTGCCGTTAC TGCATGGTCG CCTGTCCGTA CAACGTGCCG
AAGTACGACT ACAACAACCC GTTTGGTGCG CTGCATAAGT GCGAGCTGTG CAACCAGAAA
GGTGTGGAAC GTCTCGATAA AGGCGGTCTA CCTGGCTGCG TAGAAGTGTG CCCGGCGGGC
GCGGTGATTT TCGGTACGCG TGAAGAGCTG ATGGCGGAGG CGAAAAAACG TCTGGCGCTG
AAGCCTGGCA GCGAATACCA CTATCCGCGT CAGACGCTGA AATCTGGCGA CACTTACCTG
CATACGGTGC CGAAATATTA TCCGCATCTG TACGGCGAGA AAGAGGGCGG CGGTACTCAG
GTTCTGGTAC TGACGGGTGT GCCTTATGAA AATCTCGACC TGCCGAAACT GGACGATCTT
TCTACCGGTG CGCGTTCCGA AAATATTCAA CACACCCTGT ATAAAGGCAT GATGCTACCA
CTGGCTGTGC TGGCGGGCTT AACCGTGCTG GTTCGTCGCA ACACCAAAAA CGACCATCAC
GACGGAGGAG ACGATCATGA GTCATGA
 
Protein sequence
MNRRNFIKAA SCGALLTGAL PSVSHAAAEN RPPIPGSLGM LYDSTLCVGC QACVTKCQDI 
NFPERNPQGE QTWSNNDKLS PYTNNIIQVW TSGTGVNKDQ EENGYAYIKK QCMHCVDPNC
VSVCPVSALK KDPKTGIVHY DKDVCTGCRY CMVACPYNVP KYDYNNPFGA LHKCELCNQK
GVERLDKGGL PGCVEVCPAG AVIFGTREEL MAEAKKRLAL KPGSEYHYPR QTLKSGDTYL
HTVPKYYPHL YGEKEGGGTQ VLVLTGVPYE NLDLPKLDDL STGARSENIQ HTLYKGMMLP
LAVLAGLTVL VRRNTKNDHH DGGDDHES