Gene B21_02021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02021 
Symboldld 
ID8113055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2116402 
End bp2118117 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content52% 
IMG OID644848233 
Producthypothetical protein 
Protein accessionYP_002999806 
Protein GI251785502 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCCA TGACAACAAC TGATAATAAA GCCTTTTTGA ATGAACTTGC TCGTCTGGTG 
GGTTCTTCAC ACCTGCTCAC CGATCCCGCA AAAACGGCCC GCTATCGCAA GGGCTTCCGT
TCTGGTCAGG GCGACGCGCT GGCTGTCGTT TTCCCTGGCT CACTACTAGA ATTGTGGCGG
GTGCTGAAAG CCTGCGTCAC CGCCGACAAA ATTATTCTGA TGCAGGCCGC CAATACAGGC
CTGACCGAAG GATCGACGCC AAACGGTAAC GATTATGATC GCGATGTCGT TATCATCAGC
ACCCTGCGTC TCGACAAGCT GCACGTTCTT GGCAAGGGCG AACAGGTGCT GGCCTATCCG
GGCACCACGC TCTATTCGCT GGAAAAAGCC CTCAAACCGC TGGGACGCGA ACCGCACTCA
GTGATTGGAT CATCGTGTAT AGGCGCATCG GTCATCGGCG GTATTTGTAA CAACTCCGGC
GGCTCGCTGG TGCAACGTGG CCCGGCGTAT ACCGAAATGT CGTTATTCGC GCGTATAAAT
GAAGACGGCA AACTGACGCT GGTGAACCAT CTGGGGATTG ATCTGGGCGA AACGCCGGAG
CAGATCCTTA GCAAGCTGGA TGATGATCGC ATCAAAGATG ACGATGTGCG TCACGATGGT
CGTCACGCCC ACGATTATGA CTATGTCCAC CGCGTTCGTG ATATTGAAGC CGACACGCCC
GCACGTTATA ACGCCGATCC TGATCGGTTA TTTGAATCTT CTGGTTGCGC CGGGAAGCTG
GCGGTCTTTG CAGTACGTCT TGATACCTTC GAAGCGGAAA AAAATCAGCA GGTGTTTTAT
ATCGGCACCA ACCAGCCGGA AGTGCTGACC GAAATCCGCC GTCATATTCT GGCTAACTTC
GAAAATCTGC CGGTTGCCGG GGAATATATG CACCGGGATA TCTACGATAT TGCGGAAAAA
TACGGCAAAG ACACCTTCCT GATGATTGAT AAGTTAGGCA CCGACAAGAT GCCGTTCTTC
TTTAATCTCA AGGGACGCAC CGATGCGATG CTGGAGAAAG TGAAATTCTT CCGTCCGCAT
TTTACTGACC GTGCGATGCA AAAATTCGGT CACCTGTTCC CCAGCCATTT ACCGCCGCGC
ATGAAAAACT GGCGCGATAA ATACGAGCAT CATCTGCTGT TAAAAATGGC GGGCGATGGC
GTGGGCGAAG CCAAATCGTG GCTGGTGGAT TATTTCAAAC AGGCCGAAGG CGATTTCTTT
GTCTGTACGC CGGAGGAAGG CAGCAAAGCG TTTTTACACC GTTTCGCCGC TGCGGGCGCA
GCAATTCGTT ATCAGGCGGT GCATTCCGAT GAAGTCGAAG ACATTCTGGC GTTGGATATC
GCTCTGCGGC GTAACGACAC CGAGTGGTAT GAGCATTTAC CGCCGGAGAT CGACAGCCAG
CTGGTGCACA AGCTCTATTA CGGCCATTTT ATGTGCTATG TCTTCCATCA GGATTACATA
GTGAAAAAAG GCGTGGATGT GCATGCGTTA AAAGAACAGA TGCTGGAACT GCTACAGCAG
CGCGGCGCGC AGTACCCTGC CGAGCATAAC GTCGGTCATT TGTATAAAGC ACCGGAGACG
TTGCAGAAGT TCTATCGCGA GAACGATCCG ACCAACAGCA TGAATCCGGG GATCGGTAAA
ACCAGTAAAC GGAAAAACTG GCAGGAAGTG GAGTAA
 
Protein sequence
MSSMTTTDNK AFLNELARLV GSSHLLTDPA KTARYRKGFR SGQGDALAVV FPGSLLELWR 
VLKACVTADK IILMQAANTG LTEGSTPNGN DYDRDVVIIS TLRLDKLHVL GKGEQVLAYP
GTTLYSLEKA LKPLGREPHS VIGSSCIGAS VIGGICNNSG GSLVQRGPAY TEMSLFARIN
EDGKLTLVNH LGIDLGETPE QILSKLDDDR IKDDDVRHDG RHAHDYDYVH RVRDIEADTP
ARYNADPDRL FESSGCAGKL AVFAVRLDTF EAEKNQQVFY IGTNQPEVLT EIRRHILANF
ENLPVAGEYM HRDIYDIAEK YGKDTFLMID KLGTDKMPFF FNLKGRTDAM LEKVKFFRPH
FTDRAMQKFG HLFPSHLPPR MKNWRDKYEH HLLLKMAGDG VGEAKSWLVD YFKQAEGDFF
VCTPEEGSKA FLHRFAAAGA AIRYQAVHSD EVEDILALDI ALRRNDTEWY EHLPPEIDSQ
LVHKLYYGHF MCYVFHQDYI VKKGVDVHAL KEQMLELLQQ RGAQYPAEHN VGHLYKAPET
LQKFYRENDP TNSMNPGIGK TSKRKNWQEV E