Gene Nmag_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3201 
Symbol 
ID8826064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3317694 
End bp3319610 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content63% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003481313 
Protein GI289582847 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCGA TGTCCGGAGA CGCTGACACC GGCTACAGCC GTCGCTCGCT CCTCGCTGCC 
GGCGCGACTG GCCTCTCGAT TGCTGCCAGC GGTTGTATCG ACCGTGTCCA GAGCGTCGTC
GACCCTGATG CCTCCGAACA GCTCTCACTG TCTATCCTCT CGCTCCCCGC TGCCGACGGT
GACAGAGAGA CCGCCGAAAT CGCACGTCAT CTGGAGTCCA ACCTCGAAGC GGTCGGCATC
AATGCGACGA TCAGCACTCG CTCCGAGTCC GAGTTCCTGG AGGCGATTCT GATCGACCAC
GACTTCGACC TCTACGTGGG CCGCCACCCG GCTGACTTCG ACCCGGACTT TCTGTACGAG
ACGCTGCACT CCACCTACGA GTACGACCGA GGCTGGCAGA ACCCGTTTGG CTTCACCGAC
CCGCTGTCGA TCGATCCGTT GCTCGAGCGC CAGCGCAACG AAACTAGCGC CAATCGCGAG
CGGACTGTCA CGGCGCTCCT CGAGGCAATT GCACAGGTGA AACCGTTCGA ACCGATCTGC
GTGTCGAACG AGTACCGCGC CGTCCGAACT GATCGGTTCG ACGGCTGGGA CGAGACCCAC
CTCGGAACCG GACGGGGCTA CCTCGGACTC GAACCTGACG CTGCGTCCGA CGCCGAACAA
CTTCGTGCGC TCGTCACGGA CCAGCGGCTG ACCCAGACCT TCAACCCGCT CTCCCCGACG
ACGCGCGAGC AGGGGGTCAT CGTCGACCTG CTGTACGACT CGCTTGCCGT TCCAACGTGG
TCCGTTCCCG ACGAGGCTGA GGCAACGGAC GAGGAGACGC GAGCGGGTGC AGAAGTGATC
GACGCGGCAA ACGTCTCGAT CGAACCCTGG CTCGCGACCG ACTGGGAGTG GGACGGTGAG
GACCGAACCA TGACCGTCGA TATTCGCGAG GACTGTCTGT TCCACGATCA CAGCGAGGAC
GGCGACGACA CTGACGCAGA CACCGGCGAT AGCAACGGTA ACGATAGCGC CATCCCCCTC
ACCGCCGAGG ACGTCAGGTT CACCTATCGA TTCCTCGCAG ACACGACGTA CGGACGCACC
TCGAGTGCTT CCCCCGCACC ACGCTATAAG GGCCATGCAA GCGCAGTCGA CTCGGTCACC
GTCGAAGACG ACTACCGAGT CACAATTTCG TTCTCGACGG ATCGGGCCGT CGCCGAGCGT
GCGTTGACGG TCCCGATCCT CCCATCCGGC TCTGACTCAC CCTGGATCAC CCATCTCGAG
AACAGCGTCG AGTCCACAGA CGACGACTGG TCGCCGACGC AGGGTGACTG GAGCATCGTG
ACGACCGGTC ACGCGCCACC GACAGGTAGC GGTCCCTACA AGTATGACTC CCACGACGCG
GGAGAGTCCG TCACGCTCAC GCGCTTTGAG GATCACTTTA CGCTCCGGGA CGACGATCAC
GACCTGCCAG CGCCTACCGC CGACGAACTT CACTTCGAGG TCGACCCGGG AACGAGTTCG
TCGATCGAAC GGGTCGCAAG CGGCGACGCC GATATCACGA CATCGGTACT CGATACCTAC
GCACTCGAGT CGACGGTCGA CGACGACGCT GTGTCGTTCG TCGAGTCGCC GTCGTGGTCG
TTCTATCACG TCGGCTTCAA CACGCGCAAC ACGCCGTGTG GCAGCCCCAA CTTCCGTCGT
GCGGTCTCGC AACTGATCGA CAAGCAGTGG ATCGTCGACG AGGTGTTCGC TGCCGAGGAC
AGTGCACGGC CCGTCGCGAC ACCGGTTGCC GAAGAGTGGA CGCCCGACTC GCTCGCCTGG
GACGATGGCG AGTTGGTGAC GACGTTCGTC GGCGAGGACG GGGAACTCGA CGTCGACGCC
GCGCGAGAAC TGTTTGCATC ACACTTCGAG TACGACGACG GAGCGCTGGT CCGGTAG
 
Protein sequence
MNSMSGDADT GYSRRSLLAA GATGLSIAAS GCIDRVQSVV DPDASEQLSL SILSLPAADG 
DRETAEIARH LESNLEAVGI NATISTRSES EFLEAILIDH DFDLYVGRHP ADFDPDFLYE
TLHSTYEYDR GWQNPFGFTD PLSIDPLLER QRNETSANRE RTVTALLEAI AQVKPFEPIC
VSNEYRAVRT DRFDGWDETH LGTGRGYLGL EPDAASDAEQ LRALVTDQRL TQTFNPLSPT
TREQGVIVDL LYDSLAVPTW SVPDEAEATD EETRAGAEVI DAANVSIEPW LATDWEWDGE
DRTMTVDIRE DCLFHDHSED GDDTDADTGD SNGNDSAIPL TAEDVRFTYR FLADTTYGRT
SSASPAPRYK GHASAVDSVT VEDDYRVTIS FSTDRAVAER ALTVPILPSG SDSPWITHLE
NSVESTDDDW SPTQGDWSIV TTGHAPPTGS GPYKYDSHDA GESVTLTRFE DHFTLRDDDH
DLPAPTADEL HFEVDPGTSS SIERVASGDA DITTSVLDTY ALESTVDDDA VSFVESPSWS
FYHVGFNTRN TPCGSPNFRR AVSQLIDKQW IVDEVFAAED SARPVATPVA EEWTPDSLAW
DDGELVTTFV GEDGELDVDA ARELFASHFE YDDGALVR