Gene Nmag_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1789 
Symbol 
ID8824629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1821828 
End bp1822931 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content61% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003479925 
Protein GI289581459 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.463605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCAA AAAACACGCA CGGGAGACGG ACGTTCCTTC GATCGACGGC GGCAGCGGGG 
AGTGTCGCCG CCCTCGGGGG GCTCGCAGGC TGTACAGGAA TGCTCGACGG TGGGGACGAC
ACGCTGACCG TTGCGGTCTA CGGCGGTGTG TTCCAGGACG TTATGGACGA GGACCTCTTT
GCTCCGTTCG AAGAGGAAAC CGACATCAGC GTCGAGTCAG AGGCACAACC AACATCCGAG
GAAGCGCTCA CGCAGTACGA GAACGCCGTT GGTGCCGGCG ATGCGCCAGT CGACGTCGCG
ATCATGGCAC AGACCGGTGT CCTTCAGGGA CTAAACTCCG ATCTGTGGCA CATCTGGGAC
GACGACGAGT TCGAGAATCT CGAGTACATC AGCGACGATC TCGTCGGTGA GGCCGACGGC
GGCATCTCGA GTATCGGTGC GCTGTCGTGG TACATCAACC TCGTCCAGAA TACGGACGTC
ATCGAGGAGC CAATCGATTC CTGGGAGGCG CTCTGGGACG ACGAGTACGA AGATACGCTC
GGCCTGCTCG GCTACGCGTC GAACTCGTTC CTGCTCGAGG TCACCGCAGA AGTGCACTTC
GACGGCCAGG ACATTCTCGA CGACCGCGAC GGCGTCGAGG AAGTGTTCGA GGAACTCGAG
GGCGTCACGG ATCAGGCGAA CTTCTGGTAC GAGAACGAAG CGGAGTTCCA GCAGCGTCTC
CGAGACGGCG AAGTGCCGGC CGGCATGCTC TACAGTGACA TTACGCGAGT CATGCAGGAC
GACGGCGCGC CAGTTCAGTC GAACTTCGTC CAGGAAGGAT CGATTCTCGA CTCCGGGCTC
TGGGTCACAC TCGAGACGTC CGACCTCAAG GAGGAGGCGC GCGAGTTCAT CGACTACGCG
AGTCAGCCCT CGGTGCAGGA CGAACTCGCA CAGGGACTGT ACACGAGTCC GACGGTCGAA
CGTGAGTACT CCGAGATCGA CGACGACTTC TACGAGGAGG TCGCCGGACC AGGACCGGAC
GAAGCGATCA CGCCCAAGTA CGAACTCTAC GTCGAGGAAG AGGACTGGGT TAGTGAGCGC
TGGGAACAGT TCATCATCGG CTAA
 
Protein sequence
MPAKNTHGRR TFLRSTAAAG SVAALGGLAG CTGMLDGGDD TLTVAVYGGV FQDVMDEDLF 
APFEEETDIS VESEAQPTSE EALTQYENAV GAGDAPVDVA IMAQTGVLQG LNSDLWHIWD
DDEFENLEYI SDDLVGEADG GISSIGALSW YINLVQNTDV IEEPIDSWEA LWDDEYEDTL
GLLGYASNSF LLEVTAEVHF DGQDILDDRD GVEEVFEELE GVTDQANFWY ENEAEFQQRL
RDGEVPAGML YSDITRVMQD DGAPVQSNFV QEGSILDSGL WVTLETSDLK EEAREFIDYA
SQPSVQDELA QGLYTSPTVE REYSEIDDDF YEEVAGPGPD EAITPKYELY VEEEDWVSER
WEQFIIG