Gene Nmul_A2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2082 
Symbol 
ID3786086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2374239 
End bp2375228 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content53% 
IMG OID637812171 
Productbiotin--acetyl-CoA-carboxylase ligase 
Protein accessionYP_412768 
Protein GI82703202 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0340] Biotin-(acetyl-CoA carboxylase) ligase 
TIGRFAM ID[TIGR00121] birA, biotin-[acetyl-CoA-carboxylase] ligase region
[TIGR00122] BirA biotin operon repressor domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAT TCACTTTTTC CATTCTCCGG GTGCTGAGCG ATAACGAATT TCATTCAGGG 
CAGGCTATCG CCGAAGCTCT GGGAGTTTCC CGCGCCAGTG TATCGAATGC TCTTCGCGAC
GCGGATGAAG CTGGATTGAC CATTCATAAA ATCAAGGGAC GCGGCTATCG CCTGCTCGAC
CAGGTGCAAT GGCTGGAACG AAATGCAATT CTTGAGCACC TCGGTCATCA GGCGGACAAA
TTCAATCTGG AAATACTCGA TACGATCGAT TCCACCAACA GCCTTTTACT GCATGAGGCG
GATAACCGGT TGAGCCTGCG TGATGGGCTC ATTCATGTGG TAGCGGCCGA GCTGCAAACG
AAGGGGCGTG GACGACGAGG ACGGCAATGG CACTCGGGCC TGGGAGTCGG TCTCGCGTTT
TCCGTGCTAT GGCGGTTTCA GCAAAGTGCA AGCTTTCTTT CGGGTTTGAG TCTCGCCACA
GGCGTTGCAA TAGTACGTGC GCTCGAATCT TCAGGGATAC AAGGGGCCGT ACTCAAATGG
CCAAACGATG TGATGTTCAA TTTCTGTAAA CTGGCAGGTA TATTGATAGA ACTGCATGGC
GATATGCTCG GTCCCACCGT TGCTGTAATC GGTGTAGGCA TGAACCTGAA ATTGTCCGAC
AGCGTTCAGG CGCGGATAGA CCAGGGGGCA ACGGATATTT TTTCCATCAG TGGAGAAACA
CCGGATCGCA ATAAATTGCT GGCTGAATTG TTGCTGAATA TTGCTCGAGT ATTGAGAGAA
TTTGAGCAGT CGGGTTTTAC GCCATTCAAG GAGGAATGGG TGGATCGCCA TGTATGTGAA
GGCAAAGCCG TCACCCTCAA GCTACCTGAC GGGTCGGGCC AGGAAGGACT GGTGCACGGG
GTATCGGATA GCGGGTCGCT GTTGCTGCAA ACGTCACTGG GTCTTCGCAG TTTCAGCGGC
GGCGAGATAT CGCTGCGCAG GACAGCATAA
 
Protein sequence
MNPFTFSILR VLSDNEFHSG QAIAEALGVS RASVSNALRD ADEAGLTIHK IKGRGYRLLD 
QVQWLERNAI LEHLGHQADK FNLEILDTID STNSLLLHEA DNRLSLRDGL IHVVAAELQT
KGRGRRGRQW HSGLGVGLAF SVLWRFQQSA SFLSGLSLAT GVAIVRALES SGIQGAVLKW
PNDVMFNFCK LAGILIELHG DMLGPTVAVI GVGMNLKLSD SVQARIDQGA TDIFSISGET
PDRNKLLAEL LLNIARVLRE FEQSGFTPFK EEWVDRHVCE GKAVTLKLPD GSGQEGLVHG
VSDSGSLLLQ TSLGLRSFSG GEISLRRTA