Gene B21_01007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01007 
SymboltorA 
ID8112827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1062954 
End bp1065500 
Gene Length2547 bp 
Protein Length848 aa 
Translation table11 
GC content55% 
IMG OID644847269 
Producthypothetical protein 
Protein accessionYP_002998842 
Protein GI251784538 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR00509] molybdopterin guanine dinucleotide-containing S/N-oxide reductases
[TIGR02164] trimethylamine-N-oxide reductase TorA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATA ACGATCTCTT TCAGGCATCA CGTCGGCGTT TTCTGGCACA ACTCGGCGGC 
TTAACCGTCG CCGGGATGCT GGGGCCGTCA TTGTTAACGC CGCGACGTGC GACTGCGGCG
CAAGCGGCGA CTGACGCTGT CATCTCGAAA GAGGGCATTC TTACCGGGTC GCACTGGGGG
GCTATCCGCG CGACGGTGAA GGATGGTCGC TTTGTGGCGG CAAAACCGTT CGAACTGGAT
AAATATCCGT CGAAAATGAT TGCCGGATTG CCGGATCACG TACACAACGC GGCGCGTATT
CGTTATCCGA TGGTACGCGT GGACTGGCTG CGTAAGCGCC ATCTCAGCGA TACCTCCCAG
CGCGGTGATA ACCGTTTTGT GCGCGTGAGC TGGGATGAAG CCCTCGACAT GTTCTATGAA
GAACTGGAAC GCGTGCAGAA AACTCACGGG CCGAGTGCCT TGCTGACCGC CAGTGGTTGG
CAATCGACGG GGATGTTCCA TAACGCTTCG GGGATGCTGG CGAAAGCTAT TGCCTTGCAT
GGTAATAGCG TTGGTACGGG CGGAGATTAC TCTACCGGTG CTGCGCAAGT GATCCTGCCG
CGCGTAGTCG GTTCGATGGA AGTGTATGAA CAGCAAACCT CCTGGCCGCT GGTATTGCAG
AACAGCAAAA CCATTGTGCT GTGGGGCTCC GATTTGCTGA AAAACCAGCA AGCGAACTGG
TGGTGCCCGG ATCACGATGT TTATGAATAT TACGCGCAGC TAAAAGCGAA AGTCGCCGCC
GGTGAAATTG AGGTCATCAG CATCGATCCG GTTGTCACAT CCACCCATGA GTATCTGGGG
CGCGAGCATG TGAAGCACAT TGCGGTTAAC CCGCAAACTG ACGTGCCGCT GCAACTGGCG
CTGGCACATA CGCTGTACAG TGAAAACCTG TACGACAAAA ACTTCCTTGC TAACTACTGT
GTGGGTTTTG AGCAGTTCCT GCCGTATCTG CTGGGTGAGA AAGACGGTCA GCCGAAAGAT
GCCGCATGGG CTGAAAAACT GACCGGCATT GATGCCGAAA CCATTCGTGG GCTGGCGCGG
CAGATGGCGG CGAACAGAAC GCAAATTATT GCTGGCTGGT GCGTGCAGCG TATGCAGCAC
GGTGAACAGT GGGCGTGGAT GATTGTGGTT CTGGCGGCGA TGCTGGGGCA AATTGGCCTG
CCAGGTGGTG GTTTTGGTTT TGGCTGGCAC TACAACGGCG CAGGCACGCC GGGGCGTAAA
GGCGTTATTC TGAGTGGTTT CTCCGGCTCT ACGTCGATTC CGCCTGTTCA CGACAACAGT
GACTACAAAG GCTACAGCAG CACTATTCCG ATTGCCCGTT TTATCGATGC GATCCTCGAA
CCGGGGAAAG TGATCAACTG GAACGGTAAA TCGGTAAAAC TGCCGCCGCT GAAAATGTGT
ATTTTTGCCG GAACTAACCC ATTCCATCGC CATCAGCAGA TCAACCGCAT TATTGAAGGC
TTGCGCAAGC TGGAAACGGT TATCGCCATA GATAACCAGT GGACCTCAAC CTGCCGCTTT
GCCGATATCG TACTGCCTGC GACCACGCAG TTTGAGCGTA ACGATCTCGA CCAGTACGGC
AATCACTCCA ACCGTGGCAT TATCGCCATG AAACAGGTGG TGCCGCCGCA GTTCGAGGCG
CGCAACGACT TCGATATTTT CCGCGAGCTG TGCCGTCGCT TTAATCGCGA AGAAGCCTTT
ACCGAAGGGC TGGACGAAAT GGGCTGGCTG AAATGCATCT GGCAGGAAGG TGTACAGCAA
GGCAAAGGAC GCGGCGTTCA TCTGCCAGCG TTTGATGACT TCTGGAATAA CAAAGAGTAC
GTCGAGTTTG ACCATCCGCA GATGTTTGTT CGCCACCAGG CATTCCGCGA AGATCCGGAT
CTCGAACCGC TGGGCACGCC GAGTGGCCTG ATTGAGATCT ACTCGAAAAC TATCGCCGAT
ATGAACTACG ACGATTGTCA GGGGCATCCG ATGTGGTTTG AGAAAATCGA ACGCTCCCAC
GGTGGGCCTG GCTCGCAAAA GTATCCGTTG CATCTGCAAT CTGTGCATCC GGATTTCCGA
CTTCACTCGC AGTTATGTGA GTCGGAAACG CTGCGTCAGC AATATACGGT AGCGGGTAAA
GAGCCAGTAT TCATTAACCC GCAGGATGCC AGCGCGCGCG GTATTCGTAA CGGTGATGTG
GTACGCGTCT TTAACGCTCG CGGTCAGGTG TTGGCAGGGG CAGTGGTTTC TGACCGCTAT
GCACCCGGCG TGGCACGAAT TCACGAAGGG GCATGGTACG ATCCAGATAA AGGCGGCGAG
CCTGGTGCGC TGTGCAAATA CGGTAACCCC AACGTGTTGA CCATCGACAT CGGTACATCG
CAGCTGGCGC AGGCGACCAG TGCGCACACT ACGCTGGTGG AAATTGAGAA GTACAACGGA
ACAGTGGAGC AGGTGACGGC GTTTAACGGC CCCGTGGAGA TGGTGGCGCA GTGCGAATAT
GTTCCCGCGT CGCAGGTGAA ATCATGA
 
Protein sequence
MNNNDLFQAS RRRFLAQLGG LTVAGMLGPS LLTPRRATAA QAATDAVISK EGILTGSHWG 
AIRATVKDGR FVAAKPFELD KYPSKMIAGL PDHVHNAARI RYPMVRVDWL RKRHLSDTSQ
RGDNRFVRVS WDEALDMFYE ELERVQKTHG PSALLTASGW QSTGMFHNAS GMLAKAIALH
GNSVGTGGDY STGAAQVILP RVVGSMEVYE QQTSWPLVLQ NSKTIVLWGS DLLKNQQANW
WCPDHDVYEY YAQLKAKVAA GEIEVISIDP VVTSTHEYLG REHVKHIAVN PQTDVPLQLA
LAHTLYSENL YDKNFLANYC VGFEQFLPYL LGEKDGQPKD AAWAEKLTGI DAETIRGLAR
QMAANRTQII AGWCVQRMQH GEQWAWMIVV LAAMLGQIGL PGGGFGFGWH YNGAGTPGRK
GVILSGFSGS TSIPPVHDNS DYKGYSSTIP IARFIDAILE PGKVINWNGK SVKLPPLKMC
IFAGTNPFHR HQQINRIIEG LRKLETVIAI DNQWTSTCRF ADIVLPATTQ FERNDLDQYG
NHSNRGIIAM KQVVPPQFEA RNDFDIFREL CRRFNREEAF TEGLDEMGWL KCIWQEGVQQ
GKGRGVHLPA FDDFWNNKEY VEFDHPQMFV RHQAFREDPD LEPLGTPSGL IEIYSKTIAD
MNYDDCQGHP MWFEKIERSH GGPGSQKYPL HLQSVHPDFR LHSQLCESET LRQQYTVAGK
EPVFINPQDA SARGIRNGDV VRVFNARGQV LAGAVVSDRY APGVARIHEG AWYDPDKGGE
PGALCKYGNP NVLTIDIGTS QLAQATSAHT TLVEIEKYNG TVEQVTAFNG PVEMVAQCEY
VPASQVKS