Gene Nmag_4133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_4133 
Symbol 
ID8828867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp181073 
End bp182347 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content61% 
IMG OID 
Productcobalamin synthesis protein P47K 
Protein accessionYP_003482214 
Protein GI289937612 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAGC AGACGATTCC CGTGACGGTG CTTTCCGGGA CGCTTGGCGC CGGCAAGACA 
ACCACGCTCA ACAATCTCCT TCGTGAGAGC GGCGACCGCG AACTCGCCGT GCTCGTCAAC
GACATGGGTG AGGTGAACGT CGACGCGGAC CTCGTCGCCG AGTCCTCAGA CATCTCGGCG
GACGAAGAGG AGATAGTCGA GCTCTCGAAC GGCTGTATCT GCTGTGAGCT TCGTGGCGAC
CTGCTCGACG CGATCGGTGG ACTCACCCGT GAGCGGGAGT TCGACGCGAT CGTCGTCGAG
TCGACAGGCG TCGCCGAACC ACTCCCTGTT GCCCAGACGC TGACGCTCGG GTTCGATCAG
TCGGACCTCG ATCCCACGGA GTTCTACGAT GAAACGGGTA TTGAGCCGCT CGAGAACTGT
CACCTCGACA CGACGGTGAC AGTCGTCGAT GCCCATCAGT TTCACGAGGC GATGCAATCC
GACGAGATTC TCGACGACGA CGGGACGAAA AAACACCTGG GCGACCTGCT CGTCGAACAG
GTGGAGTTCT GTGACGTCTT GCTTTTGAAC AAGTGCGACC TCGTCAACGA GGAGACGCTC
AGCGAGATTG AAGAGACCCT CGAGATGCTC CAGCCTCGTG CCGAGATCGT CCGGACAACC
CACGGTCGAG TCGACGTCGA CGAAGTGGTA GACACGGGAC GGTTCGACTT CGAGGAAGCC
AGCCAATCGG CGGGTTGGAT GCGGGAACTC CAGGAACCCC ACGAGTCCGC CGAAGAAGAA
CACGGCGTAA CCTCGTTCGT CTTCGAGGCG CGACGCCCCT TCCACCCCGA GCGGTTCGCC
GAACTGCTCG ACGTGTTTCC GGAGAACGTC GTCCGATCGA AAGGTCACTT CTGGCTCGCG
GGGCGCGAGG AGATGGCGCT CATGTTGAAC GTCGCTGGCC AGTCGATTCG GGTTGCACCC
GCCGGGAACT GGATCGCCAC CCTTCCGTCC GAAGAACGCG AGGAACAGTT TGAGGCGTAC
CCCGAACTCG AGGAGACCTG GGACGACGAG TGGGGTGACC GCGGCAGCCA GTTGGTGTTG
ATCGGCACCG AGATGGACCA CGACTCGATC CGTGAACACC TCGAACTCTG TCTGCTCACC
GACGAGGAGA TGGACGCTGA CTGGGACACA TTCGACGATC GGTTCCCAAC GTTTGAACCA
CCCGAAGGTA CCGACGAAGA CGAAGCAGAG GCTCCTGACC ACAATGGCCA AGAAGAGATC
GGAATCGCAG ATTGA
 
Protein sequence
MEEQTIPVTV LSGTLGAGKT TTLNNLLRES GDRELAVLVN DMGEVNVDAD LVAESSDISA 
DEEEIVELSN GCICCELRGD LLDAIGGLTR EREFDAIVVE STGVAEPLPV AQTLTLGFDQ
SDLDPTEFYD ETGIEPLENC HLDTTVTVVD AHQFHEAMQS DEILDDDGTK KHLGDLLVEQ
VEFCDVLLLN KCDLVNEETL SEIEETLEML QPRAEIVRTT HGRVDVDEVV DTGRFDFEEA
SQSAGWMREL QEPHESAEEE HGVTSFVFEA RRPFHPERFA ELLDVFPENV VRSKGHFWLA
GREEMALMLN VAGQSIRVAP AGNWIATLPS EEREEQFEAY PELEETWDDE WGDRGSQLVL
IGTEMDHDSI REHLELCLLT DEEMDADWDT FDDRFPTFEP PEGTDEDEAE APDHNGQEEI
GIAD