Gene B21_03065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03065 
SymbolaccC 
ID8115311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3266283 
End bp3267632 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content53% 
IMG OID644849248 
Producthypothetical protein 
Protein accessionYP_003000821 
Protein GI251786517 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.669753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGATA AAATTGTTAT TGCCAACCGC GGCGAGATTG CATTGCGTAT TCTTCGTGCC 
TGTAAAGAAC TGGGCATCAA GACTGTCGCT GTGCACTCCA GCGCGGATCG CGATCTAAAA
CACGTATTAC TGGCAGATGA AACGGTCTGT ATTGGCCCTG CTCCGTCAGT AAAAAGTTAT
CTGAACATCC CGGCAATCAT CAGCGCCGCT GAAATCACCG GCGCAGTAGC AATCCATCCG
GGTTACGGCT TCCTCTCCGA GAACGCCAAC TTTGCCGAGC AGGTTGAACG CTCCGGCTTT
ATCTTCATTG GCCCGAAAGC AGAAACCATT CGCCTGATGG GCGACAAAGT ATCCGCAATC
GCGGCGATGA AAAAAGCGGG CGTCCCTTGC GTACCGGGTT CTGACGGCCC GCTGGGCGAC
GATATGGATA AAAACCGTGC CATTGCTAAA CGCATTGGTT ATCCGGTGAT TATCAAAGCC
TCCGGCGGCG GCGGCGGTCG CGGTATGCGC GTAGTGCGCG GCGACGCTGA ACTGGCACAA
TCCATCTCCA TGACCCGTGC GGAAGCGAAA GCTGCTTTCA GCAACGATAT GGTTTACATG
GAGAAATACC TGGAAAATCC TCGCCACGTC GAGATTCAGG TACTGGCTGA CGGTCAGGGC
AACGCTATCT ATCTGGCGGA ACGTGACTGC TCCATGCAAC GCCGCCACCA GAAAGTGGTC
GAAGAAGCGC CAGCACCGGG CATTACCCCG GAACTGCGTC GCTACATCGG CGAACGTTGC
GCTAAAGCGT GTGTTGATAT CGGCTATCGC GGTGCAGGTA CTTTCGAGTT CCTGTTCGAA
AACGGCGAGT TCTATTTCAT CGAAATGAAC ACCCGTATTC AGGTAGAACA CCCGGTTACA
GAAATGATCA CCGGCGTTGA CCTGATCAAA GAACAGCTGC GTATCGCTGC CGGTCAACCG
CTGTCGATCA AGCAAGAAGA AGTTCACGTT CGCGGCCATG CGGTGGAATG TCGTATCAAC
GCCGAAGATC CGAACACCTT CCTGCCAAGT CCGGGCAAAA TCACCCGTTT CCACGCACCT
GGCGGTTTTG GCGTACGTTG GGAGTCTCAT ATCTACGCGG GCTACACCGT ACCGCCGTAC
TATGACTCAA TGATCGGTAA GCTGATTTGC TACGGTGAAA ACCGTGACGT GGCGATTGCC
CGCATGAAGA ATGCGCTGCA GGAGCTGATC ATCGACGGTA TCAAAACCAA CGTTGATCTG
CAGATCCGCA TCATGAATGA CGAGAACTTC CAGCATGGTG GCACTAACAT CCACTATCTG
GAGAAAAAAC TCGGTCTTCA GGAAAAATAA
 
Protein sequence
MLDKIVIANR GEIALRILRA CKELGIKTVA VHSSADRDLK HVLLADETVC IGPAPSVKSY 
LNIPAIISAA EITGAVAIHP GYGFLSENAN FAEQVERSGF IFIGPKAETI RLMGDKVSAI
AAMKKAGVPC VPGSDGPLGD DMDKNRAIAK RIGYPVIIKA SGGGGGRGMR VVRGDAELAQ
SISMTRAEAK AAFSNDMVYM EKYLENPRHV EIQVLADGQG NAIYLAERDC SMQRRHQKVV
EEAPAPGITP ELRRYIGERC AKACVDIGYR GAGTFEFLFE NGEFYFIEMN TRIQVEHPVT
EMITGVDLIK EQLRIAAGQP LSIKQEEVHV RGHAVECRIN AEDPNTFLPS PGKITRFHAP
GGFGVRWESH IYAGYTVPPY YDSMIGKLIC YGENRDVAIA RMKNALQELI IDGIKTNVDL
QIRIMNDENF QHGGTNIHYL EKKLGLQEK