Gene Nmag_3068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3068 
Symbol 
ID8825928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3171556 
End bp3173247 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content67% 
IMG OID 
Productcobyric acid synthase CobQ 
Protein accessionYP_003481182 
Protein GI289582716 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGGA CGCTTCTCGT CGCCGGAACC GCCAGCCACG TCGGCAAATC GACCGTCGTC 
GCGGGCCTCT GTCGCCTGCT CGCCGACCAT GGTGTCTCCG TCGCGCCGTT CAAGGCGCAG
AACATGAGCA ACAATGCGCG GGTCGTCGTG CGGGCGGCTG GCGTAGATGG AAAAGAAGGC
GAAGTGAAGG GGCAGGAATC GCGTCCGACT GGAGACGGTG CAACAATGAA AGCCGCCGAC
CAGTGGGGCG AAATCGGCGT CTCGCAGTTC ACCCAGGCCC GTGCGGCCCG ACTCACACCG
ACGACTGACG TAAATCCCAT CCTGTTGAAA CCGCGCGGCG ACGGCGAGAG CCAGCTCATC
GTTCAGGGCG AGGCCCGCGA GCACGTCCCC GCAGGCAGCT ACTACGAAGA ATACTGGGAC
GACGCCCGCG ACGCCGCCGC ACAGTCCTAC CAGCGACTTG CCGCTGACCA CGACGTCATC
GTCGCCGAAG GCGCAGGCAG CATCGCCGAA ATCAACCTCC ACGACCAGGA CCTCGCGAAC
GTCGAAACGG CCCGCTTCGC CGACGCAGAG ATCCTTCTGC TCGTCGACAT CGAACGCGGC
GGCGCGTTCG CCAGCCTCTA CGGGACGATC GAACTCCTCC CCGACGATAT CCGAGAGCGC
GTCGTTGGTG CGGTCATCAC CAAGTTCCGC GGTGATCCGT CGCTACTCGA GTCCGGCATC
GCTGAGATCG AATCCCGAAC CGGCGTGTCG ATTCTGGGCG TGCTGCCGTA CGACGATCCC
GGATTGCCCG AGGAGGACAG CGTTGGCCTT CCGAGTGCGG ATTCACGGGG GGTACGCGGC
GGCGATGATG GCGTCCCCGA CGAGCAGCGC CTCCGGATCG CGGTACCGCG ACTCCCGCGG
ATTTCGAACG CGACCGATCT CGAGGCACTC GCCGCAGAAC CGGGTGTTAC GGTGGTCTAT
CTCCCGGTCG ACGATTCGGG GACGAACGCC AACGCAGAGG CGGTGTCCAG CGGTTCGTTC
GCGAGCGCCG AGATCGAGAT CGACCTCGAC GCCACCGCCG ACGCCGTCGT CATCCCCGGT
ACCAAGAACA CCGTCGACGA CTGCCGTGCG CTCCACGCCG CGGGCTTCGC CGACGCACTG
CGCTCGTTCG ACGGTCCCAT CGTCGGCATC TGCGGCGGCT ACCAGCTTCT CGGCGAACGG
CTGACGAACG CCGCACTCGA GAGCACCGAC ACCAGCCAGT GCGACACCGT TCCGGGTCTC
GGTCTCCTCC CCGTCGAAAC CCGCTTCGAC GAGACGAAAC ACCTCGAACG GACGACGGTC
CCCGTCGACG GCGCGGCGAG CCCCTTGCTC GCCGGCGCAG CCGGCACCGC ATCGGGCTAC
GAGATTCACG CCGGGCGGAC GCGCATACTC GACCACGAGT CCGTCGCCAG CCCGCTCGGC
GAGTCGAGCG CGGCACACGG ACTGGTTCTC GGGACCTACC TCCACGGGCT GTTCGACAAC
GAGGGGGTTC GGACGGCGTT TCTCGAGGCA GTTGCGAGAG AGCGGGGACT CGAGTGGCCG
CCGGCGAGCC CGACTGGGGC GCAGACAGCG CGAACTGGTG GGCAGAGGGG ACAGTCGCCG
TTCGACCGGG CGGCGTCGTT GCTCCGGGAG AACGTCGATG CGGAGGTGGT TGAACGACTG
GTAAACGGGT AG
 
Protein sequence
MTRTLLVAGT ASHVGKSTVV AGLCRLLADH GVSVAPFKAQ NMSNNARVVV RAAGVDGKEG 
EVKGQESRPT GDGATMKAAD QWGEIGVSQF TQARAARLTP TTDVNPILLK PRGDGESQLI
VQGEAREHVP AGSYYEEYWD DARDAAAQSY QRLAADHDVI VAEGAGSIAE INLHDQDLAN
VETARFADAE ILLLVDIERG GAFASLYGTI ELLPDDIRER VVGAVITKFR GDPSLLESGI
AEIESRTGVS ILGVLPYDDP GLPEEDSVGL PSADSRGVRG GDDGVPDEQR LRIAVPRLPR
ISNATDLEAL AAEPGVTVVY LPVDDSGTNA NAEAVSSGSF ASAEIEIDLD ATADAVVIPG
TKNTVDDCRA LHAAGFADAL RSFDGPIVGI CGGYQLLGER LTNAALESTD TSQCDTVPGL
GLLPVETRFD ETKHLERTTV PVDGAASPLL AGAAGTASGY EIHAGRTRIL DHESVASPLG
ESSAAHGLVL GTYLHGLFDN EGVRTAFLEA VARERGLEWP PASPTGAQTA RTGGQRGQSP
FDRAASLLRE NVDAEVVERL VNG