Gene B21_00989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00989 
SymbolappB 
ID8115401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1043795 
End bp1044931 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID644847251 
Producthypothetical protein 
Protein accessionYP_002998824 
Protein GI251784520 
COG category[C] Energy production and conversion 
COG ID[COG1294] Cytochrome bd-type quinol oxidase, subunit 2 
TIGRFAM ID[TIGR00203] cytochrome d oxidase, subunit II (cydB) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.787651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGATT ATGAAACATT GCGCTTCATC TGGTGGCTGC TGATTGGCGT GATCCTGGTG 
GTCTTTATGA TCTCCGACGG ATTTGACATG GGGATCGGCT GTCTGCTGCC GCTGGTGGCG
CGTAATGATG ATGAACGCCG GATAGTGATA AACAGCGTTG GTGCACACTG GGAAGGCAAC
CAGGTCTGGT TGATCCTCGC TGGTGGGGCA TTATTTGCCG CCTGGCCCAG AGTGTATGCA
GCGGCGTTTT CCGGCTTTTA TGTGGCGATG ATCCTGGTGC TGTGCTCACT GTTCTTCCGC
CCGCTGGCCT TTGATTATCG CGGAAAAATC GCCGATGCAC GCTGGCGTAA AATGTGGGAC
GCCGGTCTGG TCATCGGCAG TCTGGTGCCG CCGGTAGTCT TCGGTATCGC CTTCGGCAAC
TTGTTGCTCG GCGTGCCGTT TGCCTTCACA CCGCAATTAC GCGTGGAGTA TCTCGGCAGC
TTCTGGCAAC TGCTGACGCC ATTCCCTTTA TTGTGCGGAT TGCTCAGCCT TGGGATGGTG
ATTTTGCAAG GTGGCGTCTG GTTACAACTG AAAACTGTTG GTGTGATTCA TCTGCGTTCA
CAGCTGGCGA CCAAACGCGC TGCACTGTTG GTGATGCTGT GCTTTTTGCT GGCGGGTTAC
TGGCTGTGGG TCGGTATTGA TGGCTTTGTA CTGCTCGCCC AGGATGCTAA CGGTCCTTCC
AATCCGTTAA TGAAACTGGT GGCAGTGCTA CCTGGTGCCT GGATGAATAA TTTTGTCGAG
TCGCCCGTTT TGTGGATCTT CCCGCTGCTG GGATTCTTCT GCCCATTGCT GACGGTGATG
GCGATTTATC GTGGTCGCCC GGGTTGGGGA TTTTTGATGG CATCATTGAT GCAATTTGGC
GTGATTTTCA CGGCAGGCAT CACGCTGTTC CCCTTTGTCA TGCCGTCAAG CGTGAGTCCG
ATCTCCAGCC TGACGTTGTG GGACAGTACT TCCAGTCAGC TGACGCTGAG CATTATGTTG
GTAATCGTGC TGATATTTTT GCCCATTGTG TTGCTCTACA CTCTCTGGAG CTACTACAAA
ATGTGGGGGC GCATGACAAC AGAAACTCTC CGCCGTAACG AAAACGAGTT GTACTAA
 
Protein sequence
MFDYETLRFI WWLLIGVILV VFMISDGFDM GIGCLLPLVA RNDDERRIVI NSVGAHWEGN 
QVWLILAGGA LFAAWPRVYA AAFSGFYVAM ILVLCSLFFR PLAFDYRGKI ADARWRKMWD
AGLVIGSLVP PVVFGIAFGN LLLGVPFAFT PQLRVEYLGS FWQLLTPFPL LCGLLSLGMV
ILQGGVWLQL KTVGVIHLRS QLATKRAALL VMLCFLLAGY WLWVGIDGFV LLAQDANGPS
NPLMKLVAVL PGAWMNNFVE SPVLWIFPLL GFFCPLLTVM AIYRGRPGWG FLMASLMQFG
VIFTAGITLF PFVMPSSVSP ISSLTLWDST SSQLTLSIML VIVLIFLPIV LLYTLWSYYK
MWGRMTTETL RRNENELY