Gene Nmag_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2234 
Symbol 
ID8825084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2290533 
End bp2291792 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content64% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003480361 
Protein GI289581895 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGACG ATACCATGAA CGTCGACCGT CGCGGTGTGC TCCGTGGCGT CGGCGGCGGG 
GCGATCGTAC TGGCGGGACT GGGCGGCGCT GGCAGTGTCA GCGCACAGGA TGCAATCACA
GTCACGGCGG TCTGGACCGA CGACGAGGAA GAAGACTTCC TCGCTGTGGT CGACTACGTC
GAGGACGAGA CCGACATCGA CATCTCGTAT GCGCCACGAG ACACCGAAAC GCTCCTGACA
GAGACGCTGA TGGACTACGA GGCCGGCATC GCGACGGCGG ATATCGTCGT GTTGCCGACC
GAGGGACGTG TCCGACGGGA CGGCGAAGCG GGGCACCTCG AACCGGTTGG TGACCTGTGG
GACGAAGACG AGTACTCGAC CGAGCACGCG GTGGTCGAGG CGAACGGCGA GGTCTACGCC
GCTCCGTTCG GGATGGACCT CAAGCCGGGC TTCTGGTACC GCCAGTCGTT CTTCGATGAA
CACGGACTCG AGGAGCCCGA GGATTACGAC GCCTTCCTCG ACCTCCTCGA CGAGATCGAC
GGCATCGAAG GCGTCGAGGC ACCGCTCGCG TCCGGGAACG GCGACGGCTG GCCGCTCAGC
GACGTCACGG AGGCGTTTAT CCTTCGGCAG GACGACGGCG CACAGCTCCA GCAGGACCTC
ATCGAGGGCG ATGCCGAGTT CACCGACGAC CGCGTCGTCA CGGCCTTCGA GGAACTACAG
GAACTCTTGC AGGCGGGCTA CTTCAGCGAG GTCCGTGATT TCGGTGTGCA GTACGAGTTC
TTCTGGGAGA ACGAGACGCC CCTGTACTTC ATGGGGTCGT GGACACCAGC CTTCGGCGCA
ATCGAGGATC CAGACGACCT CGAGTACTTC ATGCTCCCGG GTACCGATGC GATGGTGACC
AGCATCAACT GGTTCACCAT CCCCGCGTAC ACGGAGGCGA CTGACGCGGC CAGAACCGCC
GTCGAGGAAA TCATCTCTCC CGACGGTCAG GAAGTCTGGA CCGAACGCGG CGGTTTCGTT
CCGTCATCGC TCGAGGTGCC GGCAGACGCG TTCGACCACG ACATCATGCA GGAACTGTCC
GAACACGCTG ACGAGGTCGA ACTCGTCCCC GACCTCGACG ACGCGGTCGG CGATCCGTTC
CAGGCCGAGT TCTGGTCGCA ACTGCTCGGT CTCTGGGCCG AACCAGACCA GGACGTCACC
GGCATCACCG AGTCGCTCGA CGGCGTCTTG CAAGAAACCG TTCAGGAGGA CGACCCATAG
 
Protein sequence
MGDDTMNVDR RGVLRGVGGG AIVLAGLGGA GSVSAQDAIT VTAVWTDDEE EDFLAVVDYV 
EDETDIDISY APRDTETLLT ETLMDYEAGI ATADIVVLPT EGRVRRDGEA GHLEPVGDLW
DEDEYSTEHA VVEANGEVYA APFGMDLKPG FWYRQSFFDE HGLEEPEDYD AFLDLLDEID
GIEGVEAPLA SGNGDGWPLS DVTEAFILRQ DDGAQLQQDL IEGDAEFTDD RVVTAFEELQ
ELLQAGYFSE VRDFGVQYEF FWENETPLYF MGSWTPAFGA IEDPDDLEYF MLPGTDAMVT
SINWFTIPAY TEATDAARTA VEEIISPDGQ EVWTERGGFV PSSLEVPADA FDHDIMQELS
EHADEVELVP DLDDAVGDPF QAEFWSQLLG LWAEPDQDVT GITESLDGVL QETVQEDDP