Gene Nmag_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2372 
Symbol 
ID8825224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2418413 
End bp2419603 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content57% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003480496 
Protein GI289582030 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAT TCACACGACC GGTATCCAGG AGAGGGGTTT TAACGAGTGG TGCAACTGCA 
GTCGGAGTGG CGGTTGCTGG TTGTATAAGT GACGATACAG ACGGAGCCGG CCAGAGCGGC
GAGTCTGGGG GTGGATCGGG CAATGGATCC AGCGAAGAAA CGTACGAGGT TGGATACGGA
GACTATCGAA CGACAGTAAA CGCGTCGGCG TTTCCGGACG AACTACGAAT TTACGCGGTC
CAAACCGGTT GGTCGAATTG GGACGCCGTA ATGGAGAACT TCGAAAGCGA GTACGGTGTT
CCCCTCTACG ACGCACAGGG ATCGTCTGGC GAGGCACTCA CCGACGCACG GTCAAACGCC
GGTAATCAGA CACATTCAGC GTTTAACGGC GGCTACTCGT TCGCTCTCGA GGCGATGAAC
GATGGCCTGA CGACGGATTA TAAGCCCGCC AACTGGGACG TGGTCCCTGA CGAACTCAAA
ACCGACAATG GTCACGTCGT TTCGACTCGA CAGATGACGA CAGCGGTCAC CTACCGTGTT
GACATTTATG AGGAACGCGG TCTCGACGCA CCCGAGACCT GGGAAGACCT CAAACACCCA
GACATCGCAC AAGATCTGGC CTTCACGCCA CCTCATACAG CTAATGGACT TGCGTCGGCA
CTGTCGGTCA ATAGAGCCTA CGGCGGTTCG ATGGCGAATC TAGATCCTGT TATCGAGTAT
CACGAGGAAA TCGCCGACCA CGGCGCAGAC ATTCGTCGAA ACATCGAGGG AGACGTTACC
AGCGGCGAGA TATCGACCGT CATTGAGTAC GATTACTCGG GACTGAACAT GAAGTACAAC
ATGGATGAGA TCGACGAGGA ACAACTCGAG GTCGCAATAT TGACCGGTCC GAGCGGCAGG
GAGGGGGCGA TGAACGTTCC GTACGGGTTT GGACTGCTCG AGGGGGCACC AAATCCCGAG
GCGGCGAAGT TGTTCATGGA CTACGTGCTC TCGCTAGAGG GTCAGGAGCT GTTCTTCGAC
GCGTTCGTCC GCCCGATTCG GGCCGACGAA CTCGAGCAAC CCGAGGAATT CCCCGATCAG
TCCGACTACG ACGCAGCCGA GTTCGCCCTC GATCAGGAGG AACTGGTCGC AAACCAGGAG
TCGATCCAGC AGGAACTCAC CGAACGAACC CCGCTACCGG GCGCACAGTA G
 
Protein sequence
MAKFTRPVSR RGVLTSGATA VGVAVAGCIS DDTDGAGQSG ESGGGSGNGS SEETYEVGYG 
DYRTTVNASA FPDELRIYAV QTGWSNWDAV MENFESEYGV PLYDAQGSSG EALTDARSNA
GNQTHSAFNG GYSFALEAMN DGLTTDYKPA NWDVVPDELK TDNGHVVSTR QMTTAVTYRV
DIYEERGLDA PETWEDLKHP DIAQDLAFTP PHTANGLASA LSVNRAYGGS MANLDPVIEY
HEEIADHGAD IRRNIEGDVT SGEISTVIEY DYSGLNMKYN MDEIDEEQLE VAILTGPSGR
EGAMNVPYGF GLLEGAPNPE AAKLFMDYVL SLEGQELFFD AFVRPIRADE LEQPEEFPDQ
SDYDAAEFAL DQEELVANQE SIQQELTERT PLPGAQ