Gene Nmag_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1438 
Symbol 
ID8824271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1469484 
End bp1471148 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content62% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003479579 
Protein GI289581113 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.3177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGACG ATAGCAAATC GCGGCGTACC GTGCTGAAGG GTGTTGGGAT TGCAAGCGCA 
GCGGGGCTAA CGACTAGTCT CGCAGGATGT GTCAGTCAGA ACGGTGGCTC AGATGTCGAA
GGTGCGGAAA ATCTCGGTGA CGAAGTGCCC GAGATTCATC TCGTTGCACC GACGGCAGGT
GCGAACCCGT TCCGTAACGA GCTCTCGGAT ATCGTTGCCG ACAACTGGGA AGAACTCGGC
TTCGAGGTCG ACCGCGAAGA ACTCGATTTC AACGCTCACG TCGATCAGGT GGTCGTCGAA
CAGAACTTCG ATGCGTCGCT ACTCGGCTGG GGTGGCACGC CGGAGCGGAT CGACCCGCAC
ACGTTCATTT TCGACATGCA CCACTCCTCG ACGACGGGAG AGGGCGGTCG AAACACGCCG
GGCTGGGAGA GCGACGAGTA CGACGAACTG GCGGAGCTGC AGGTAGCGCA GGTCGACGAA
GACGAGCGCC AGCAGACCGT CTACGAGGCA CAGGAGATGA TCGCGGAAGC GCAGCCGCGA
ACGTACATCG CAAACGAAGG CGGCTACGAG CCATACGCGA GCGCCCGCGT TACCGACATC
AACCCGACGC TCGGTGAAGG GCTGAACTCG TTCTGGAATC TGACGTCGGT GACGCCGACG
GACGACGACA CGGTCCGCTT TGGTTACCCA TCCGAGATTA TCTCGCTGAA CCCAATGCAG
GACCTGGCGA CGCCCGACCG GCAGTTCGTC CGCCTCATCT ACGATCAGCT CTACCGGATC
GGTGAGGACG GAATGCCGAC ACCGTGGCTG GCGGCGGACG ACCCGGTCAT CGAAGACGAC
GGGATGACCT ACACCGTCGA GATTCGGGAC GGCCACACCT TCCACGACGG CGAGTCAGTG
ACGATCGACG ACGTGGAGTT CACGTACGAA CTCTACGCCG ACTCGCCGAC GTACAGCTCG
CTCGTCGAGG ACATCGACGA GATCGACACC TCGGGGAACG AGATCACGTT CCACCTCGAA
GAGCAGTACT CGCCGTTCGT GGCAAACGTC CTCGGACAGG TGTACATCTT CCCCGAACAC
GTCTGGGGCG ACGTCGATCC GGAGGAACTC GTCGACTACG AAGACGAGGA TTGGATCGGC
AGCGGCCCGT TCGAGTTCGT CGACTGGGAG CGCCAGGCCG AACTGCAGCT GTCGGCGTTC
GACGACCACT TCGAGGCCCC GAACGCGGAC AACCTCATCC GTGTCCCCGG TTCCGACACC
GCACAGCTCG TCAACGACCT CGAGGCCGGC CAGCTCGATA TGGTCGGTGC GGTGCCACAG
CCGACGGCTG TCGACCGCGT CAGAGAGGAC GACGATCTCG ACCTCGCCGA GTTCGAGGCG
ATCGGATACG CGATGATCGA GTACAACATG CGTCGCGAAC CGTTCGACGA CCGTCACGTC
CGCCGGGCAC TCTCCTACGG TGTTCCGAAG GAGGAGTACG TCGAGTTCAT CCGTGACGGG
ATGGGAACGG TGACGCACTC GACCATCTCC GAGCACAACG AGTTCTGGCA CAACCCCGAC
GTCGAGCAGT TCAACGAGGA CTTAGAGGCT GCACGTCAGG AGCTCGCAGA CGGTGGCTAC
GGCTGGGACG ACGACGGACG TCTCCACTAC GGCGAAGACC AGTAA
 
Protein sequence
MGDDSKSRRT VLKGVGIASA AGLTTSLAGC VSQNGGSDVE GAENLGDEVP EIHLVAPTAG 
ANPFRNELSD IVADNWEELG FEVDREELDF NAHVDQVVVE QNFDASLLGW GGTPERIDPH
TFIFDMHHSS TTGEGGRNTP GWESDEYDEL AELQVAQVDE DERQQTVYEA QEMIAEAQPR
TYIANEGGYE PYASARVTDI NPTLGEGLNS FWNLTSVTPT DDDTVRFGYP SEIISLNPMQ
DLATPDRQFV RLIYDQLYRI GEDGMPTPWL AADDPVIEDD GMTYTVEIRD GHTFHDGESV
TIDDVEFTYE LYADSPTYSS LVEDIDEIDT SGNEITFHLE EQYSPFVANV LGQVYIFPEH
VWGDVDPEEL VDYEDEDWIG SGPFEFVDWE RQAELQLSAF DDHFEAPNAD NLIRVPGSDT
AQLVNDLEAG QLDMVGAVPQ PTAVDRVRED DDLDLAEFEA IGYAMIEYNM RREPFDDRHV
RRALSYGVPK EEYVEFIRDG MGTVTHSTIS EHNEFWHNPD VEQFNEDLEA ARQELADGGY
GWDDDGRLHY GEDQ