Gene Nmag_0080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0080 
Symbol 
ID8822899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp96366 
End bp97670 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content58% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003478241 
Protein GI289579775 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.306901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAGCC AATTCAAATC ATTCACCGAG AAACTCCTAT CATTCCGTAC AGAGTCCGGC 
AACGAACAAC CAGCACAGCG GTGGATTCGT AACCAGTTAG ATGCAGTTGG GTTCGAAACC
TATGAGTGGA CGGCAGATCC GGAGCTGCTT GCGAACCATC CATCGTTTCC ATCAGATCCG
GCCACAATAG AGACCGCAGA CCGGCCATCG GTGGCTGGCG TTGTTGAATT CGGGGATCCT
GATGAAGGCC AAACGATCGT TCTCAACGGC CACGTCGATG TCGTCCCTGC TGAGGAAGCA
CAGTGGGATA CTGACCCGTT CACGCCAACG TGGGACGGCG AGAAGCTGAT TGCCCGTGGC
GCTGCAGACA TGAAAGCCGG CCTGAGCGCC TGTCTCTTCG CTGCAAAAGA ACTTGCTGCA
CAAAACACAG ACAGCGACGA ACTGAATGGG CGTCTTGTCG TCGAGAGCGT CGTCGGCGAA
GAGGAGGGCG GAATCGGTGC AGCAATGGCC GCACTATCGA ACCCGTATCC GTTTGAGCGA
GATGCAGCAA TCGTTGCGGA ACCAACAGAG CTAGAGTTGG TCACAGCAGT CGAGGGCTCG
GTGATGCTCC GGCTGGAACT CGAGGGGAAA TCTGCTCACG CGGCGACACG GTGGCGTGGG
GAATCAGTAC TGCCGCACTT CGAGCGAATT CGAACAGCGC TTCGAGAACT GGAGACGGAG
CGCTCTCTCA CCGTTACACA TCCACTCTAC GAGCGGTTTG AGACACCGTG GCCGATCTCA
GTTGGAACAG TTCAGGCTGG TTCGTGGGCC TCCTCAGTTC CGGCGACGCT CACCGCTGAG
ATACGGGTCG GTGTCGCACC CGGGGAAACA GTCACAGAGG TAGAGTCGGC CGTCCGCGAC
CGGATTGACG CTGTCGTCGA CGGAGACGAC TGGCTCGAAG CACATCCCCC ATCACTCGAA
CGGTTTTCAG TCCAGTTCGA ACCGGCGTCT GTGTCCCACG ATGAGCCGAT CGTCCGTCAC
TTGCAAGCGG GAATGGAACA GAACGGCCTC GCAGATACCG CGCCAAAAGG CGCGACGTAC
GGCGCGGATT CGAGACACTA TCAGGCTGCA TCGATACCAA CCGTTCTCTT TGGGCCGGGG
TCGATCGACA ATGCACACTT CCCGAACGAG TCCATTCAGT GGGACGCTGT TGAACAGAGC
AAAGATGTCC TCGTAGACAC GCTTGCAGCT ATTCTGGGGG AGGACACACC GACCAACACG
GCCCGTACCA GCCACGAACC TGAAGGGTCA CGTTCGAAGA ATTAA
 
Protein sequence
MGSQFKSFTE KLLSFRTESG NEQPAQRWIR NQLDAVGFET YEWTADPELL ANHPSFPSDP 
ATIETADRPS VAGVVEFGDP DEGQTIVLNG HVDVVPAEEA QWDTDPFTPT WDGEKLIARG
AADMKAGLSA CLFAAKELAA QNTDSDELNG RLVVESVVGE EEGGIGAAMA ALSNPYPFER
DAAIVAEPTE LELVTAVEGS VMLRLELEGK SAHAATRWRG ESVLPHFERI RTALRELETE
RSLTVTHPLY ERFETPWPIS VGTVQAGSWA SSVPATLTAE IRVGVAPGET VTEVESAVRD
RIDAVVDGDD WLEAHPPSLE RFSVQFEPAS VSHDEPIVRH LQAGMEQNGL ADTAPKGATY
GADSRHYQAA SIPTVLFGPG SIDNAHFPNE SIQWDAVEQS KDVLVDTLAA ILGEDTPTNT
ARTSHEPEGS RSKN