Gene B21_02171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02171 
SymbolnuoC 
ID8113026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2288617 
End bp2290419 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content56% 
IMG OID644848377 
Producthypothetical protein 
Protein accessionYP_002999950 
Protein GI251785646 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01961] NADH (or F420H2) dehydrogenase, subunit C
[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAACA ATATGACCGA CTTAACCGCG CAAGAACCCG CCTGGCAGAC CCGCGATCAT 
CTTGATGATC CGGTGATTGG CGAACTGCGC AACCGTTTTG GGCCGGATGC CTTTACTGTT
CAGGCGACTC GCACCGGGGT TCCCGTTGTG TGGATCAAGC GTGAACAATT ACTGGAAGTT
GGCGATTTCT TAAAGAAACT GCCGAAACCT TACGTCATGC TGTTTGACTT ACACGGCATG
GACGAACGTC TGCGCACACA CCGCGAAGGG TTACCTGCCG CGGATTTTTC CGTTTTCTAC
CATCTGATTT CTATCGATCG TAACCGCGAC ATCATGCTGA AGGTGGCGCT GGCAGAAAAC
GACCTGCACG TACCGACCTT CACCAAACTG TTCCCGAACG CTAACTGGTA TGAGCGTGAA
ACCTGGGATC TGTTTGGCAT TACTTTCGAC GGTCACCCGA ACCTGCGACG CATCATGATG
CCGCAAACCT GGAAAGGTCA CCCGCTGCGT AAAGATTATC CGGCGCGCGC TACCGAATTC
TCGCCGTTTG AGCTGACCAA AGCCAAACAG GATCTGGAGA TGGAAGCCCT GACCTTCAAA
CCGGAAGAGT GGGGGATGAA GCGCGGCACC GAAAACGAGG ACTTCATGTT CCTCAACCTC
GGTCCGAACC ACCCGTCGGC GCACGGGGCT TTCCGTATCG TTTTGCAACT CGATGGCGAA
GAGATTGTCG ACTGCGTACC AGACATCGGT TACCACCACC GTGGTGCGGA GAAAATGGGC
GAACGCCAGT CCTGGCACAG CTACATTCCG TATACTGACC GTATCGAATA CCTCGGCGGC
TGCGTTAACG AAATGCCTTA CGTGCTGGCG GTAGAGAAAC TGGCCGGGAT CACCGTGCCG
GATCGCGTTA ACGTCATTCG CGTTATGCTC TCCGAACTGT TCCGCATCAA CAGTCACCTG
CTGTATATCT CGACCTTTAT TCAGGACGTC GGCGCAATGA CGCCAGTGTT CTTCGCCTTT
ACCGATCGTC AGAAAATTTA CGATCTGGTG GAAGCGATCA CGGGTTTCCG TATGCACCCG
GCGTGGTTCC GTATTGGCGG CGTAGCGCAC GACCTGCCGC GCGGCTGGGA TCGCCTGCTG
CGTGAGTTCC TCGACTGGAT GCCGAAACGT CTGGCGTCTT ACGAGAAAGC GGCGCTGCAA
AACACCATTC TGAAAGGTCG TTCCCAGGGC GTTGCCGCCT ATGGCGCGAA AGAGGCGCTG
GAGTGGGGCA CCACTGGCGC GGGCCTGCGT GCTACCGGGA TCGACTTCGA CGTGCGTAAG
GCGCGTCCTT ATTCTGGCTA TGAAAACTTC GACTTTGAAA TCCCGGTGGG TGGTGGCGTT
TCTGACTGCT ACACCCGCGT AATGCTTAAA GTGGAAGAGC TGCGCCAGAG TCTGCGCATT
CTTGAGCAGT GCCTCAACAA CATGCCGGAA GGCCCGTTCA AAGCGGATCA CCCGCTGACC
ACGCCGCCGC CGAAAGAGCG CACGCTGCAA CATATCGAAA CCCTGATCAC CCACTTCCTG
CAAGTGTCGT GGGGGCCGGT GATGCCTGCC AATGAATCTT TCCAGATGAT TGAGGCGACC
AAGGGGATCA ACAGTTACTA CCTGACCAGC GACGGTAGCA CCATGAGTTA TCGCACTCGT
ATCCGCACGC CGAGTTATGC GCATTTGCAG CAAATTCCGG CGGCGATCCG CGGCAGCCTG
GTGTCTGACC TGATTGTTTA TCTGGGCAGT ATCGATTTTG TTATGTCAGA TGTGGACCGC
TAA
 
Protein sequence
MVNNMTDLTA QEPAWQTRDH LDDPVIGELR NRFGPDAFTV QATRTGVPVV WIKREQLLEV 
GDFLKKLPKP YVMLFDLHGM DERLRTHREG LPAADFSVFY HLISIDRNRD IMLKVALAEN
DLHVPTFTKL FPNANWYERE TWDLFGITFD GHPNLRRIMM PQTWKGHPLR KDYPARATEF
SPFELTKAKQ DLEMEALTFK PEEWGMKRGT ENEDFMFLNL GPNHPSAHGA FRIVLQLDGE
EIVDCVPDIG YHHRGAEKMG ERQSWHSYIP YTDRIEYLGG CVNEMPYVLA VEKLAGITVP
DRVNVIRVML SELFRINSHL LYISTFIQDV GAMTPVFFAF TDRQKIYDLV EAITGFRMHP
AWFRIGGVAH DLPRGWDRLL REFLDWMPKR LASYEKAALQ NTILKGRSQG VAAYGAKEAL
EWGTTGAGLR ATGIDFDVRK ARPYSGYENF DFEIPVGGGV SDCYTRVMLK VEELRQSLRI
LEQCLNNMPE GPFKADHPLT TPPPKERTLQ HIETLITHFL QVSWGPVMPA NESFQMIEAT
KGINSYYLTS DGSTMSYRTR IRTPSYAHLQ QIPAAIRGSL VSDLIVYLGS IDFVMSDVDR