Gene B21_01833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01833 
SymboltorY 
ID8112833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1902264 
End bp1903364 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content48% 
IMG OID644848052 
Producthypothetical protein 
Protein accessionYP_002999625 
Protein GI251785321 
COG category[C] Energy production and conversion 
COG ID[COG3005] Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGGGA AAAAACGCAT TGGGTTATTG TTTTTGCTGA TAGCGGTTGT GGTTGGTGGC 
GGCGGGTTAT TGCTGGCGCA AAAAGCCTTA CATAAAACGT CGGATACAGC ATTTTGCCTT
TCCTGCCACT CGATGAGTAA ACCTTTTGAG GAATATCAGG GAACTGTCCA CTTTTCGAAC
CAGAAAGGGA TACGTGCGGA ATGTGCCGAT TGCCATATTC CAAAGTCAGG GATGGATTAT
TTATTTGCTA AATTAAAAGC ATCTAAAGAT ATTTATCATG AATTTGTTAG CGGCAAAATA
GACAGTGACG ATATGTTCGA AACTCATCGC CAGGAAATGG CCGAAACAGT ATGGAAAGAA
TTAAAAGCAA CTGACTCTGC AACGTGCCGT AGTTGCCATT CTTTTGATGC CATGGATATT
GCCTCGCAAA GTGAATCTGC GCAGAAAATG CATAACAAAG CACAAAAGGG CGGCGAAACC
TGTATCGATT GTCATAAAGG CATTGCCCAT TTTCCGCCAG AAATAAAAAT GGATGACAAC
GCGGCGCATG AGCTGGAAAG TCAGACCGCT ACTTCAGTGA CTAATGGCGC ACATATTTAT
CCTTTCAAAA CTTCTCGCAT AGGCGAGCTG GCTACCGTGA ATCCTGGTAC CGATCTCACC
GTCGTTGATG CCAGTGGCAA ACAGCCGATC GTTCTGTTGC AGGGTTATCA AATGCAGGGC
AGTGAAAACA CGCTCTACCT GGCGGCAGGT CAACGGCTGG CGCTAGCCAC ATTAAGTGAA
GAAGGTATCA AGGCGCTCAC GGTAAACGGG GAATGGCAGG CTGACGAATA CGGCAATCAA
TGGCGTCAGG CGTCTTTACA GGGTGCGCTT ACCGATCCCG CATTAGCGGA CCGTAAACCG
CTATGGCAAT ACGCTGAAAA ACTTGACGAT ACCTATTGCG CTGGTTGTCA TGCCCCTATT
GCCGCCGACC ATTACACCGT CAATGCGTGG CCGTCCATTG CCAAAGGAAT GGGGGCACGA
ACCAGCATGA GCGAAAACGA ACTGGACATT TTAACGCGGT ATTTCCAGTA CAACGCCAAA
GATATTACCG AGAAACAGTG A
 
Protein sequence
MRGKKRIGLL FLLIAVVVGG GGLLLAQKAL HKTSDTAFCL SCHSMSKPFE EYQGTVHFSN 
QKGIRAECAD CHIPKSGMDY LFAKLKASKD IYHEFVSGKI DSDDMFETHR QEMAETVWKE
LKATDSATCR SCHSFDAMDI ASQSESAQKM HNKAQKGGET CIDCHKGIAH FPPEIKMDDN
AAHELESQTA TSVTNGAHIY PFKTSRIGEL ATVNPGTDLT VVDASGKQPI VLLQGYQMQG
SENTLYLAAG QRLALATLSE EGIKALTVNG EWQADEYGNQ WRQASLQGAL TDPALADRKP
LWQYAEKLDD TYCAGCHAPI AADHYTVNAW PSIAKGMGAR TSMSENELDI LTRYFQYNAK
DITEKQ