Gene Nmag_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1074 
Symbol 
ID8823905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1096949 
End bp1099954 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content64% 
IMG OID 
Productprotein of unknown function DUF1508 
Protein accessionYP_003479220 
Protein GI289580754 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTTC TGCCAGGTGG CCGGAGGTTT ATTATGATGG GTTGCAGTTC TCCGCTCACA 
TTTCGAATGT CTTCACCAAG TGACGTTCAC CACAAACTGT ACCGGCTGTA CGAACGCTAC
GTCGGCGAAC CCGACTCGAC GAAGGATGTC TACGGCTACT GGCTGTTCAT CGTCGGCTAC
GTCGTTGCTG CGGCGGCCGT ATTCACGTTC GTTGCCGGCT ATGCGGGGGA TGCAGATACG
TACGGGCTAA TCAGGGCCTC GGGTGTGACC GCGGCGACAG GGCTTGCACT CTGCCTGTTC
GGGATCGTTC TCATGCTTCC AGTTCGAAGA CGAGGAATTC AGGCGAGTGT GTTGGGACTG
CTGATTTCGT TTGGTGGCGT TGCGTTCTTC GCGTGGGCGT ATCCGTACAA TTGGCGAGAA
CTCGGCACGG ACTACAGCGT TCACGTCATC CTCGTCTACA CGGTCGGGAT TGGGATCATC
GCGGGTGTTA CTGCGCTCGT TCCCGTTCTG ACCGGCCGGA AAGGGATGTT CGTCGAGGAG
GAAGGAGAAA CGGAAGATCC GCCGATTCTC ACCGGAGATG CACTGGAGGG TGCACAGTTC
GCTGTCTTCC GTGACGACAA CGGCGACTGG AAGTGGAACG TTCTCCATCT CGAAGCGCTG
GCGACGAGCA ACGAGAGCGC CGTGACTCGA CCGAAGGCCA CCGAGGGAAT TGAACGCGTC
CAGTCCCAGA TCAGTTCAGC TGGACTGATG GAGCTCACCA CGTCCGCGTT CCGGCTCTAC
GAGGACAGAG ATGGGAGCTG GCAGTGGACG CTCGCCCGAG ACGACGGCAG CGTCGTCGGC
ACCTGTGCTG GCGAGTTTAG CGAGCGCGAT GGGGCTGAGG AGTCCGTGAG CTTCCTCAAA
GATCGTGGAC CAACGGCAGA CGTGATCGAA ATCGACGGCG CGGCGTTCAC GTACGCCGAA
GAACGCGACC AGTGGCACTG GCAACTGGTG GACGACGAGC GGTTGCCGCT GGCTTCGGGT
GCGAACGGCC ACGGCACCCA GGAGAACGCC GAGACGGCCG CACGCACGTT CGCCGAGCGG
TTCGACCAGG CACGCGTACT CGACCTCGAA CACGTTGGTG CCGAACTCTA CGACCGAACG
GACGACAGCG GCGCGAACGG CTGGTCCTGG CGCTTCGTCG ACGAACAGGA TTCACCGCTT
GCCGCCGCAA CCGACGCGTA CGACGCCCGG CGCGACGCAG AGGAAGCTGC GGATGCACTG
CTTTCGGAAC TCGGCAGTGC GTCGGTGACG GTGGCTGGCG AACCAACCTA CGAACGCTAC
CAGACCGGCG ACCAGTGGCG CTGGCGGCTG GTCGGCGAGT CCGAACACGT TGTCGCCCAA
AGTCCAAGCG ACGCCGAAAC CGAGGCCGAC GCGACTCACG AGACCGACAC CTTCGGAGCA
CACGCCCGCG ACGCCGACGT CGTCGAAATC GAGGACGCGG AGTACGAGGT CTATCCGACC
GACAGCCAGG AACTAACCTA CGAGGAGGGC GACGCACTGC CTGCAACGTC CGACGAGCAG
CAGATGGTGT CGACCGACGG CGGCACGGCG ACGGCCGAGG GGGAGGACGG CGCAGACGAC
GGCCGCTCCT GGCACTGGCG TCTCGTCACC GAAGACCGCG ACGTGATCGC CGGAAGCACC
GAACCCCACT ACGACGCCGA GACGGCGACC GAAGCGATCC AGCGCGTTCG CGAGCAAGCG
AGCGAAGCCG AACTCATCGA GTTCGAGGAG GCTGCCTTCC AGGTCTACGA AGCCGATGAC
GGCGAGTGGC GCTGGCGGCT CATCGACGAG GACGGCAACG TCCTCGCAGA CAGCGGTGCA
GAACACACCT CCCGCGGCGA GGCCGCAGAA GCGATGATGA CGCTCAAAGA GCAGGCGCCG
GACGCCGAAC TGCTCGAAAT CGAAACGGCA GCCTTCGAGC TCTTCGTCAA CGAGGACAAC
GAATGGGGCT GGCGACTTAT CGACGAAGCC GGTCAGCTCG TCGCCGAAGA TCCGTCGACG
CACCCAACCC GCGGTGCCGC GCGCAAGGCG ATGAACCGAC TCCTCGAGTA CCTCGACTCT
GACGTGCGGA CCATGGAAGA TGCGATCTTC CAGCCGTACG CAGCGGACGA CTGGCACTGG
CGGTTCGTCC TGCCAACCGG GGAAACGGTC GCCGTTGCCG GTGACACCTA CGCGACACGC
GACGAACTCG TCGATGCCAT CCCTGCCGTT CGCGACGCAG CCGAATCCGC ACAGGACTAC
ACGATCGGCA ACGTCACGAT CCAGCTCTAC CGCAGCGGTG ATTGGAGCTT CCGACTCCTC
GACCGCGATC GCAAGGAGAT TGCCGACGCG ACTGACACCT ACGCGGAACG CGACGCCGCA
CTCGAGATCG TCGAAGATCT CAAAGCACAC GCCGACGATG CCCCGATCTT CACGATCGAG
GACGCCGCGA TCCGCGTCAC TGACGCTGAC ACGGACGACG GCTGGACATG GGACCTCGTC
GACCGCGAGC GCACCGTCCT CGCAAGCGCC GTCGACACGG TGGCGAGCCG CGAGGAACTT
CACGAGGAGA TCGAAACTGT CCGCCAGCTC GCACCGATGG CCGGCCGTGT CGACTTCGAC
GTTGCCTCGT TCGAACTCGT CGCCGACGAG GACGACCGCT GGCAGTGGCG GCTCATCGAC
GAGGACGGCC ACACGGTCGC CACCGGCTCC GAATCACACG AATCGAGCGA GGCCGCTCGT
GAGGCACTCG AGAACGTCCG CGAACTGATC GACGCAGCGA GCATCCTCGA GATCGACAGC
GTCTCCTTCG AACTCCATAC CGCGGAGGAC GAGAACGAGG ATGGCTGGGT CTGGCGGCTG
GTCGACGAGT ACGGCTCGAC GATGGCCCAG AGCACGCAGG TTTACGAGTC CCGGACGGAC
GCCCGTGAGG CGATGAACAA CGTGAAAGCG GAAGCCCCAG AGGGCTGGAT CACGTTCACG
GAGTAA
 
Protein sequence
MSVLPGGRRF IMMGCSSPLT FRMSSPSDVH HKLYRLYERY VGEPDSTKDV YGYWLFIVGY 
VVAAAAVFTF VAGYAGDADT YGLIRASGVT AATGLALCLF GIVLMLPVRR RGIQASVLGL
LISFGGVAFF AWAYPYNWRE LGTDYSVHVI LVYTVGIGII AGVTALVPVL TGRKGMFVEE
EGETEDPPIL TGDALEGAQF AVFRDDNGDW KWNVLHLEAL ATSNESAVTR PKATEGIERV
QSQISSAGLM ELTTSAFRLY EDRDGSWQWT LARDDGSVVG TCAGEFSERD GAEESVSFLK
DRGPTADVIE IDGAAFTYAE ERDQWHWQLV DDERLPLASG ANGHGTQENA ETAARTFAER
FDQARVLDLE HVGAELYDRT DDSGANGWSW RFVDEQDSPL AAATDAYDAR RDAEEAADAL
LSELGSASVT VAGEPTYERY QTGDQWRWRL VGESEHVVAQ SPSDAETEAD ATHETDTFGA
HARDADVVEI EDAEYEVYPT DSQELTYEEG DALPATSDEQ QMVSTDGGTA TAEGEDGADD
GRSWHWRLVT EDRDVIAGST EPHYDAETAT EAIQRVREQA SEAELIEFEE AAFQVYEADD
GEWRWRLIDE DGNVLADSGA EHTSRGEAAE AMMTLKEQAP DAELLEIETA AFELFVNEDN
EWGWRLIDEA GQLVAEDPST HPTRGAARKA MNRLLEYLDS DVRTMEDAIF QPYAADDWHW
RFVLPTGETV AVAGDTYATR DELVDAIPAV RDAAESAQDY TIGNVTIQLY RSGDWSFRLL
DRDRKEIADA TDTYAERDAA LEIVEDLKAH ADDAPIFTIE DAAIRVTDAD TDDGWTWDLV
DRERTVLASA VDTVASREEL HEEIETVRQL APMAGRVDFD VASFELVADE DDRWQWRLID
EDGHTVATGS ESHESSEAAR EALENVRELI DAASILEIDS VSFELHTAED ENEDGWVWRL
VDEYGSTMAQ STQVYESRTD AREAMNNVKA EAPEGWITFT E