Gene Nmag_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1503 
Symbol 
ID8824337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1531473 
End bp1532681 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003479641 
Protein GI289581175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.396474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCTGC GATGTTCGCT GCTCGGGCAC GACTACGGGG AATCCGAGGT CGACCGCGAG 
CGCGAAGAAC GGGGCAGTGA GGTCGTCGTC ACCGTCCAGG AGTACGAAGA GTGTGTCCGC
TGTGGTGACA GACACGTCAT CAGTGAGAAC ACTGAGGTAA CGAGTCTCTC GGCCGCACCG
GCGACTGAGT CGGACGCGGT TGCTGACGCA GCTGCCACAG CTGACACCGC TGAGACGACT
GCGACGCAAG ACGCCGACCT GCCCCACGAC GACGTATCGA CAGCCACGTC CACGCCGACG
TCCCCCACGG ACTCGACTGC GGCCGAGCAA GGCACAACAG AAGAACCCGC TGACGCGACG
ACAGTCGAGG CGGACGACGC AATTACCGAC GACGCGACGA TCATCGACGG TGATGGCGGT
GACGATGGTG ATGGTGTCAG TGACGGTGAT GGTGACGGTG AAACTGAGCC TACGCGTGCA
CACAATGACG TCGAGTCACA CGCCGCAGAT ACCGCCGCCA CTGACGCCGA GACCGCACCC
GCCTCTGACG GTGCGGACGA ACTCAACGTC CCGACGGACG AGGACGGCAA CCCAGTCACC
GACGACGGCG AGATACTCGA GGACGAGCCA GCACACGCCA GCGCGGGTGG CGACAGCGAC
CGCGAGCACG GCGAGTGGCC AGACTCCGAC GACGTCGGCC CGCCAGTCGA CGAGGAAACA
GATACCGAAC ACGAAGAGTG GCCCGACACG GGCGAGCAGG TCGATGACGA TGCAGTCGTT
CTCGAACACG ACTCGACGGC CTACGAAGAT GACTCGGCAG CCGATGCCAG AACAGCGACC
GTCGACGCCA CTGACCACGA ACAGTTCGGT GGATCGCAGG ATGCCGGGCA GCGCGATGAG
CCAGTGAGCG AATCGAACGC GACCGGGTTT GGGGCGACGG CCGGAACAGG TACCGATACT
GGTGGCGGTG ACACGACCGC CGCTGCAGCT GAAGCGGAGG CCGAGATGGC CGAAACAGGG
AGCGGCATCG AACGCGTCGG AGACGCCCCC GCCCCCGGCG ACGCTACCCA TCCAGACGAC
GACGTGCCGA GCGAGTTCTA CTGTCCACGC TGTGAGTTCG TCGTGAGCAG CGACCGCGGC
TCGCTTCGGG CAGGCGATAT CTGTCCGGAG TGTCGGAAGG GATACCTCGG CGAACGGGCC
AGACAGTAA
 
Protein sequence
MVLRCSLLGH DYGESEVDRE REERGSEVVV TVQEYEECVR CGDRHVISEN TEVTSLSAAP 
ATESDAVADA AATADTAETT ATQDADLPHD DVSTATSTPT SPTDSTAAEQ GTTEEPADAT
TVEADDAITD DATIIDGDGG DDGDGVSDGD GDGETEPTRA HNDVESHAAD TAATDAETAP
ASDGADELNV PTDEDGNPVT DDGEILEDEP AHASAGGDSD REHGEWPDSD DVGPPVDEET
DTEHEEWPDT GEQVDDDAVV LEHDSTAYED DSAADARTAT VDATDHEQFG GSQDAGQRDE
PVSESNATGF GATAGTGTDT GGGDTTAAAA EAEAEMAETG SGIERVGDAP APGDATHPDD
DVPSEFYCPR CEFVVSSDRG SLRAGDICPE CRKGYLGERA RQ