Gene Nmag_3933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3933 
Symbol 
ID8826803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp336385 
End bp337875 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content67% 
IMG OID 
Productpeptidase M20 
Protein accessionYP_003482036 
Protein GI289583626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGACT CGAGTACGCC CGCCGGCGCG TCCGATCCCA CGGACGCCCT CGAGGAACAC 
GACGAGACGT TCCGCACCGA TCTCGAGTCG TTGCTCGCCC AGCCCTCGAT CAGCGCGACC
GACGAGGGCG TTGTCGAGTG CGCCTCGATG GTGCAGGAAC TGTGTCTCGA GTACGGCTTC
GACGAGGCGG AGATCGTCGA GACGCCGGGA CAGCCGGCGG TTATCGCGCA TGCGGCGGCC
GACCGTGGCG GAGAGGCGCG AGAAAAAGCG GCTGACGATG ACGAGACCAG TCAGGAGACA
CCGACAATCC ACCTCTACGG CCACTACGAC GTCCAGCCCG CGACACCCGA GGAGTGGGAC
TCACCGCCGT TCGAGCCAAC CGTCCGCGAG GGGCCCGACG GCGAGCAGCG CCTCTACGCC
CGCGGCGCGG GCGACAACAA GGGCCAGTGG TTCGCCCACG TCTGTGCCGT CCGCGCACTG
CGCGAAACCA CCGGCCTCCC CGCGAACGTC ACCCTCCTGA TCGAGGGCGA GGAGGAAAGC
GGCAGCGAGC ATCTCGAGTG GCTCGTCCGC GAGCACCGGG ACGACCTCGC CTGCGACGTT
GCCGTCGTCG CGGACGGGCC GATCGATTCG TCGGGACGGC CCCACGTCCT GCTCGGCGCG
CGCGGACTCC TGTACGTCGA CCTCGAACTG CGCGGGGCGA ATCAGGACCT GCACTCGGGC
AACTTCGGCG GCCCGGTGCC GAACCCGGCC GCCGCACTCA CCGATCTACT CGCCTCACTC
GAGGACGACG GGCACGTGAC GCTCGATGGA TTCGACGACG ACGTGCGGCC GCTAACCGAC
CGCGGTCGAG AAATCGTCGC GGAGATTCCG GTCGACGAGG ACGAGATTCG AGACGAACTC
GCGCTCGACG CGTTCGAAAC CGATGCGGAC GAGAACTACG TCGAGCGCCT GCTTACGCGT
CCGAACCTCA ACGTCGCGGG GCTCGACGCG GGCTACCACG GCGACGGGAT GAAGACGGTG
CTCCCCTCGG AAGCGAGCGC GAATATCGAC TTCCGACTGG TTGCCGACCA GGATCCGGAC
GCGATCTACG AGTCGCTCGT CGACTACGCG ACGGCACACG TGCCGGCTGG CATCGAGGTC
GAACTCTCCC GCGTCGCCGC GATGGCACCG CAGCGGACGC CAGCCGACAG TCCCGTGGTC
GAGCCGGCGA TGCGGGCGAC GCGCGAAGGA TGGGGCACCG AGCCGATTCT GAAGCCGACA
CTCGGTGGGT CTGTTCCGAC GTACGTCTTC GCGGACAACT TGGACGTGCC GTGTCTCGTG
ATCCCCTACG CGAACGAGGA CGAGCGTAAC CACGCGCCGA ACGAGAACCT CAAACTCTCG
TGCTTCCGCG CAGGGGCACG GACTACAGTA GCACTCCTTT CGGAGTTTGC CGAGGCAGAT
CTTTCGGGTT CCTCGGCCTC GGCCTCGTCC TCGACCTCAA CTTCGACTTA G
 
Protein sequence
MTDSSTPAGA SDPTDALEEH DETFRTDLES LLAQPSISAT DEGVVECASM VQELCLEYGF 
DEAEIVETPG QPAVIAHAAA DRGGEAREKA ADDDETSQET PTIHLYGHYD VQPATPEEWD
SPPFEPTVRE GPDGEQRLYA RGAGDNKGQW FAHVCAVRAL RETTGLPANV TLLIEGEEES
GSEHLEWLVR EHRDDLACDV AVVADGPIDS SGRPHVLLGA RGLLYVDLEL RGANQDLHSG
NFGGPVPNPA AALTDLLASL EDDGHVTLDG FDDDVRPLTD RGREIVAEIP VDEDEIRDEL
ALDAFETDAD ENYVERLLTR PNLNVAGLDA GYHGDGMKTV LPSEASANID FRLVADQDPD
AIYESLVDYA TAHVPAGIEV ELSRVAAMAP QRTPADSPVV EPAMRATREG WGTEPILKPT
LGGSVPTYVF ADNLDVPCLV IPYANEDERN HAPNENLKLS CFRAGARTTV ALLSEFAEAD
LSGSSASASS STSTST