Gene Nmar_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0821 
Symbol 
ID5773808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp725326 
End bp726306 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content35% 
IMG OID641316459 
Productbiotin synthase 
Protein accessionYP_001582155 
Protein GI161528329 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0695199 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTC TAGAGTTCAT CAAAGAGTGT CAAGAAAAAG TATTTTCAGG AAATCACATT 
ACTGCTGAAG ATGCTGAAAA ATTACTAAAC ATTCCAGAGG AGAATCTGAA GGATTTGGCA
AGATGTGCAA ATGAGATAAC TCGAGATTTT AATGGAGAAA AAGTAGACGT TGAACAACTA
AACAACATAA AGAAAAATGC ATGTAGTGAA GACTGTACAT TTTGCGGACA GTCTGCATTC
TTTGATACAG GTATAGAGAC ATACCAACTA CCATCACCTG AAGAAGTAGT GTCAAAGGCT
CAAAAAGCAA AAGAAGAAGG TGCAGAGTCA TATTGTCTAG TTGCAGCATG GAGAGAACCA
TCAAGAACAG ATTTTGAAAA AGTTTGCAAA ATTATTACTG AAATTAATGA TAAAGTTGGA
ATAAGTGTTG AATGTAGTCT AGGATTCCTT ACACAAGAAC AAGCAAAAAA ACTCAAAGAT
CTCAAAGTAA AAAGATACAA CCATAATTTA GAGACAGCAA AATCAAAATT TCCAGAAATA
TGTACAACTC ACACATATGA AGACAGACTA GAAACACTAG GAATAGCAAG AGATGCAGGA
TTGGAGTTAT GTACTGGTGG AATTATCGGA TTAGGTGAAA CAAGAGAACA GAGATTAGAA
TTAACATTAG AGTTAGCAAG ATTGTACCCT GAAGAAGTAA CAATCAACAT TTTGGTACCA
GTACCAGGAA CTCCATTGGA ATTACAAACA GATTTGCCAA ATTCTGAAAT TGTCAGAATG
TTTTCAGTTA TCCGATTTTT ACTTCCAGAG TCAGTCATTA AAATCTCAGG AGGAAGAGAA
ACCAACCTAG AGGATTCAGG CGAGGAATTA CTTCAAAGTG GAGCAAATGG AATCATTACC
TCAGGATACC TTACTATGGG GGGCAATGAA GCTCAAAAAG ACCATGCAAT GATTGAAAAG
ATTGGTCTTA AATCACAATA A
 
Protein sequence
MSTLEFIKEC QEKVFSGNHI TAEDAEKLLN IPEENLKDLA RCANEITRDF NGEKVDVEQL 
NNIKKNACSE DCTFCGQSAF FDTGIETYQL PSPEEVVSKA QKAKEEGAES YCLVAAWREP
SRTDFEKVCK IITEINDKVG ISVECSLGFL TQEQAKKLKD LKVKRYNHNL ETAKSKFPEI
CTTHTYEDRL ETLGIARDAG LELCTGGIIG LGETREQRLE LTLELARLYP EEVTINILVP
VPGTPLELQT DLPNSEIVRM FSVIRFLLPE SVIKISGGRE TNLEDSGEEL LQSGANGIIT
SGYLTMGGNE AQKDHAMIEK IGLKSQ