Gene Nmul_A2752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2752 
Symbol 
ID3785723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3154617 
End bp3155975 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content56% 
IMG OID637812843 
Productacetyl-CoA carboxylase, biotin carboxylase 
Protein accessionYP_413431 
Protein GI82703865 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAAA AAATACTGAT AGCAAACCGT GGCGAGATTG CCTTGCGGAT ACAGCGTGCA 
TGCCGTGAGA TGGGTATCAA GACTGTAGCG GTGCACTCCC AGGCCGATGC CGAGGCAAAA
TATGTAAAAC TCGCTGATGA GTCCGTATGC ATCGGACCTG CTCCTTCGGC ACAAAGCTAC
CTCAATATTC CCGCCATCAT CAGCGCCGCA GAGGTCACCG ATGCGGAAGC CATTCATCCG
GGTTACGGGT TTCTGTCGGA AAATGCCGAT TTTGCCGAAC GTGTGGAAAA AAGCGGCTTC
GTCTTTATCG GCCCGCGGCC CGAGACGATT CGGCTGATGG GCGACAAGGT CAGTGCCAAG
AACGCCATGA AAAAGGCGGG CGTGCCTTGT GTCCCGGGCT CGGACGGCGG TCTCCCCGAA
TCGGTGGACG AAATCAAAGC CATCGCTCGT GCCATCGGCT ACCCTATCAT TATCAAGGCC
GCGGCAGGTG GGGGCGGCCG TGGCATGCGC GTGGTGCACA CTGAAGCAGC GTTATCGAAC
GCGGTGATCA TTACGCGCAA CGAAGCTCAG GCCGCCTTCG GCAACCCGAC GGTCTACGCC
GAAAAATATC TGGAGAACCC GCGCCACATC GAGTTTCAGG TATTGGCGGA TGAGCATCGC
AACGCCATCT ATCTGGGGGA ACGGGATTGC TCCACGCAGC GCCGCCATCA GAAAATTATC
GAGGAAGCAC CTGCCCTGGG CATTCCCCCC AGATTGCGCG ACAGAATGGG AACCCGTTGC
GTCGACGCAT GCAAGCGCAT CGGCTATCGC GGTGTGGGTA CATTCGAGTT TCTGTTCGAA
AAAAACGAGT TCTATTTCAT TGAGATGAAT ACGCGCCTGC AGGTCGAGCA TACCATTACC
GAGGCCATCA CCGGGATCGA TCTGGTGCAA GCGCAGATCC GCGTGGCAGC CGGCGAAAAG
CTCACTCTGC GGCAGCGCGA CATCGTGTTG AAGGGTCATG CCATCGAATG CCGTATCGCG
GCCGAAGACC CGTACAAGTT CACCCCGTCG GCAGGGCGTA TCACCCAGTA TCATGCTCCC
GGTGGCCCTG GCATTCGTGT GGATTCCCAT ATATATCATA ACTACTTCGT GCCGCCCTAT
TACGATTCCA TGATAGGCAA AATAATCGCG TATGGAGACA ACCGCGAGCA GGCGATAGCA
CGAATGCGCA TTGCGCTGTC GGAAATGGTG ATCGGCGGCA TCAAAACCAA TACGCCGCTC
CACCTTGACC TGTTGTCCGA CGCCGCTTTT CTGAATGGCT GCACGAGCAT TCATTATCTG
GAGCAAAAGC TTGCCAATTA TAATAACAAT TCCGGCTGA
 
Protein sequence
MFEKILIANR GEIALRIQRA CREMGIKTVA VHSQADAEAK YVKLADESVC IGPAPSAQSY 
LNIPAIISAA EVTDAEAIHP GYGFLSENAD FAERVEKSGF VFIGPRPETI RLMGDKVSAK
NAMKKAGVPC VPGSDGGLPE SVDEIKAIAR AIGYPIIIKA AAGGGGRGMR VVHTEAALSN
AVIITRNEAQ AAFGNPTVYA EKYLENPRHI EFQVLADEHR NAIYLGERDC STQRRHQKII
EEAPALGIPP RLRDRMGTRC VDACKRIGYR GVGTFEFLFE KNEFYFIEMN TRLQVEHTIT
EAITGIDLVQ AQIRVAAGEK LTLRQRDIVL KGHAIECRIA AEDPYKFTPS AGRITQYHAP
GGPGIRVDSH IYHNYFVPPY YDSMIGKIIA YGDNREQAIA RMRIALSEMV IGGIKTNTPL
HLDLLSDAAF LNGCTSIHYL EQKLANYNNN SG