Gene Nmag_4098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_4098 
Symbol 
ID8828832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp139807 
End bp143001 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content57% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003482182 
Protein GI289937580 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTTCGC GCGGTATCCG TGTCGGCCTG GGTTTCCCCG ATCAACCGGG CCGACCAAGC 
CAACACCGAA GGGTCAATCT GGCTCCAGTT ACCTCGAATA AATTTTCATT CACTGCAAGT
AGAATACCAG TTTTACGTGA CATTCGTGGT AGTAGTTCTA TGTCACATCG TAGCGTTTTC
CTGGCAGCCT GTATCGGCAT ACTGGTTGTC TGCTGTGCAA TCGCACTCGT TCCCGCGGGG
GTCAGTGCAA CCGACGACGA TACACCCGAC CCTGACGAGT ACGACTCACT CGTGGACGGA
ATGGACGGAA ATGGCACGGC AGATGACCCT TTCCACGTCA CGAACGTGAC GGAGTTGCAG
GCGATGGAGG CGAACCTATC GGCGCACTAC AAACTAGTTT CGCCGATCGA CGCGAGCGAG
ACGGCCGACT GGAACGACGG TGATGGTTTC GATCCGATCG GCGCGTGCGA CTTCAACGCA
ACAATCGACG AATGTGAGGA AACCCCGTTT GAAGGGTCGT TTGACGGCGG GCTGTATCCG
ATTGCCGACC TCACGATCGA CCGTGAGAAC GAGTCTGAGG TGGGGCTTTT CGGATACGTC
TTTCCCGACG GTGAAGTTGT CGATATCAAA CTGCGTGACG CCCGCGTGAC CGGCAACGTC
GAGGTAGGTA CCGTCACGGG ACTGAACTTC GGGTCGATCG AGGGTGCGGA CGTTACCGGA
ACCACTTCAG GCGACCTCGA CGCTTTCCCG GGACGGATCG GCGGACTGGC CGGGTACAAC
GGCGGCGAAG TGCGCGACTC GTTCGTCGAC AACGATGTGA TCGGACCGAG CTTCACGGGC
GGTGCCGTCG GCTCGAACAA CGGGACCATT GTCCAGACCC ACGCGAGCGG CGATGTCGAG
GCGTTGGACA TCTCTGGGGG CCTTGTTGGT ACGAACGTCG GGGACGGCGA GGTCGGACAC
TCGACTGCCA GCGGCGACGT GAACGGCACT TTCAGTGGAT TTGGTGGATT TGTCGGTACC
CACGTGGCTG GAGAGGGGAT CATCTACGAA TCGTACGCGA CCGGTGACGT CTCCGGTGAA
ACTATCAGCG CGGCCGGCGG GTTCGCTGGA TCGAACTCCG CCCTCACCGG CGAGGCCGTA
ATCTACGACG CGTACGCGAC TGGGAACGTG AACGGCGAGG ACCGTCTTGG TGGCTTCGTC
GGCGACCTCG GATCCAGTGC GTTCGTGGAG ATGTCATACG CAACCGGATC CGTTTCCGGT
GCGCCAGACG AAGCGAGCGT TGGCGGCTTC GCCGGCAACG TCCCCGACCG AGACGCCGTC
GCGATCACTG GCTCGTACTA CGACGAGGTA ACAAGCCAGC AGGATGAGGG TATTGGCGTC
GGTGACGGTG ACGTGACCGG GCTGCCGACC CAGAACATGA CGGATGACGC GGCGGCCGAG
AATATGACCG CGTTCGACTT CGCTGCGATC TGGACGACGC TCCCCGACGA CTACCCGACG
CTGCAGGCGC TCGATCCGGA ACCGATGCCG CCAGACCCGC CGAACTTCGC CGTGACGATC
GACGAAACGA CCAGTCCCGT CACCGAGGGT TCACCCCTCA ACGTGAACAC GACCATCGAA
AACGTCGGTG AACAGGCGAC CGAGCAAACC GTGTCGCTAG AGGTTGCCGG CGACCAACGA
GACGTGACGA CGGTGGCGCT CGGCGGTGAC GAGTACGAAA CCGTGGTACT CACCTGGGAG
ACCGAGGTGA ACGCCGCCGG TGACTACGAC GCCACTGTCT CCAGTGATGA CGACTCCGAG
ACCGTGCCCA TCACCGTCGA AGAACAGCCC GACGATGCCG TCTTCGACGT GACGATCGAC
GAGACCAACA GTCCCGTCAC GGAAGGAGGC AATCTGCTGG TGAATGCTAC CGTTGAGAAC
ACCGGCAATC TTGCCGACGA ACAGGACGTC AATCTGACGA TCAACGGTGA CGAAGTGGAT
GCAACGGTCG TCAGCCTCGA CGGTGACGAA ACCGAAGAGA TAACGTTCAC CTGGGAGACC
GAGGAGAACG ACGCCGATGA CTACGACGCC ACCGTCTCCA GTGATGATAA CTCCGAGACC
GTGCCCATCA CCGTCGAAGA ACAGCCCGAC GATGCCTTCT TCGACGTGAC GATCGACGAG
ACCAGCAGTC CCGTCACTGA GGGCGAGGAA CTTGAGGTCG GTGCCACGAT TGAAAACACC
GGCGATCTTG CTGACGAACA GGACATCAAA CTCTCCATCA ACGATAGCGA GATGGACGTC
ACGACGGTGG ACCTCGACGG TAACGAGACC GAAGAGGTCA CGCTTATTTG GGAGACTGAA
AAATCCGAGG CCGGTGACTA TGTGGCCAAC ATTTCAAGTA ATAACGACCT CGACATGGCA
AATGTATCGG TCGGAGAGAA ACCGGACCCA CCAACGCCCA CTCCACCGAC ACCGGATCCT
GCGTTCTTCG ATGTGACTGT TGATAATACA ACCAGTCCCG TTACAGAGGG TGAAAAGTTA
CTAGTCAACG CCACGATCGA GAACACCGGC GACCGGTCCG ACAAACAGGA CATCAACCTA
ACAATCAACG GTAACGAGGT CAACGTCACA TCGATCGAAC TCGACAGCGA TGAAAGCGAA
AAAGTGACAC TTACGTGGGA AACTGAAAAA TCCAATACTG GGGAGTACGT CGTTACAGTT
TCAACGAAAG ATAATACTGA TATGGAAAAT GTCACCGTAA ACGCAATTAA ACCTGCATTC
TTCACCGTCG ATATCAAAGA AGTCACTGAC TCGGTTCATA TCAATGAAGA AGTGTGTGAA
AAAGCATATA TTACGAATGT CGGTGAGGAA GTAGACACCC AGAACGTTGT GTTGGATATT
GACAAACAAG AAGGTGTCGA CAGCACAACA GTCACGCTGA AGCCCAGCAA GTCACAGAAG
GTGACACTCT GCCACGAATG GATTACTGCG GATGCAGACA AGGACGTCCC TATGACTGTT
CGTAGCGATA ACAGTGCGGA GACGGTCAGT GTCAGTATCA TCGGATCCGA GCCGGTGAAA
GAAGACGATG ATGAGGAAAC AGAACCTGAT GTGTTAGACG ATGATGAAAC AGCTGACGGG
ACACCAGGGT TTGGTGTTGT AGGCACCCTC ATTGTGGTTC TCATGGCGGT AGCACTTGCT
CATCGACGCC GCTGA
 
Protein sequence
MVSRGIRVGL GFPDQPGRPS QHRRVNLAPV TSNKFSFTAS RIPVLRDIRG SSSMSHRSVF 
LAACIGILVV CCAIALVPAG VSATDDDTPD PDEYDSLVDG MDGNGTADDP FHVTNVTELQ
AMEANLSAHY KLVSPIDASE TADWNDGDGF DPIGACDFNA TIDECEETPF EGSFDGGLYP
IADLTIDREN ESEVGLFGYV FPDGEVVDIK LRDARVTGNV EVGTVTGLNF GSIEGADVTG
TTSGDLDAFP GRIGGLAGYN GGEVRDSFVD NDVIGPSFTG GAVGSNNGTI VQTHASGDVE
ALDISGGLVG TNVGDGEVGH STASGDVNGT FSGFGGFVGT HVAGEGIIYE SYATGDVSGE
TISAAGGFAG SNSALTGEAV IYDAYATGNV NGEDRLGGFV GDLGSSAFVE MSYATGSVSG
APDEASVGGF AGNVPDRDAV AITGSYYDEV TSQQDEGIGV GDGDVTGLPT QNMTDDAAAE
NMTAFDFAAI WTTLPDDYPT LQALDPEPMP PDPPNFAVTI DETTSPVTEG SPLNVNTTIE
NVGEQATEQT VSLEVAGDQR DVTTVALGGD EYETVVLTWE TEVNAAGDYD ATVSSDDDSE
TVPITVEEQP DDAVFDVTID ETNSPVTEGG NLLVNATVEN TGNLADEQDV NLTINGDEVD
ATVVSLDGDE TEEITFTWET EENDADDYDA TVSSDDNSET VPITVEEQPD DAFFDVTIDE
TSSPVTEGEE LEVGATIENT GDLADEQDIK LSINDSEMDV TTVDLDGNET EEVTLIWETE
KSEAGDYVAN ISSNNDLDMA NVSVGEKPDP PTPTPPTPDP AFFDVTVDNT TSPVTEGEKL
LVNATIENTG DRSDKQDINL TINGNEVNVT SIELDSDESE KVTLTWETEK SNTGEYVVTV
STKDNTDMEN VTVNAIKPAF FTVDIKEVTD SVHINEEVCE KAYITNVGEE VDTQNVVLDI
DKQEGVDSTT VTLKPSKSQK VTLCHEWITA DADKDVPMTV RSDNSAETVS VSIIGSEPVK
EDDDEETEPD VLDDDETADG TPGFGVVGTL IVVLMAVALA HRRR