Gene Nmag_0096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0096 
Symbol 
ID8822915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp117996 
End bp119192 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content60% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003478257 
Protein GI289579791 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.265254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAATC GACGCGACAT CATCAAAGGT GCGGGAGCAG CTAGCATAGC GGGTCTGGCC 
GGGTGTCTCG GTGGAGACAA CGGCGGCAGG GACATCAGAC CGGTCGAGAT CGATTTCGAC
GACTGGCCGC CAGAGGAGTA CGGCGGCAAT CTTAACGCCT GGAACTGGTA CGTCGAGTGG
AACGAGTGGG GAGCCGAGGA TTTCGCCGAG GAGTACGACC TCGACAGCTA CTCCACAGAG
GCGTACTCGA CGCCAACCGA CTGGTTCAGC AATCTACAAG CCAGCCCAGA GAATCACGGG
ATCGATCACA TCGGTGCCTT CACGGAGTGG GTTCACCGCG CCCGTGAGGA AGAGATGATC
GAACCGATAC CGATCGACGA GTTACCCAAC GTCGAGGTCG CTGATCAGTA CCTCGATCCA
CACCGTGAAC TGTTCTGGAG CGACGACGGC GTCGGTGGCG TCTACGGACT ACCCCACTCA
GTTGTGATCA GCCCGATCGT GATGTACAAC ACCGAGGAGG TCGAGGACCC GCCCGAGTCG
CTCGACATCC TCTGGGATGA GGAGTACGCG GATGAGATTT CGATGATGGC ACACCACGGC
GGGTTCCTCT GCGATGTCGG AGCGCTGTAC ACCGGCCAGG ATCCGAACGA TCCGGACGAC
TTCGAAGAGA TCCAGGAGGT ACTCGAGCAA CAGCGCGACC TCGTCTTCAA CTACGCCGAC
GAACACGAGA CACAGATGCA ACTCGTGATG AGCGGTGACG CTGCCCTTGG AACGCACACA
GACGGGCGAG CGTTCAGGGC GATGTACAAC CAGGGCGGCG ACGTCGACTG GTTCATCCCG
GAAGAAGGCG CGACCTGGGG GACAGACGTG ATCCTGCTGC CACAAAACGC TCCGAACCCG
GTAACGGCGA CGATGTACAT CGACCACCTG TTCACGGATA CTGGCTGGGA GAAGTTTGTC
GAAACGACGG TGTACCGACC GCCGTTCGAA AACGAAGAGT TCACCGACGG GGAACTCGGC
GACGCCATTC GCGAAAAGTG GGACGATGAG TGGGACAAAC ACGGCGAGGC GGAAGATTTC
ATCGACGACC TGGTTATCAC CGACGAAGAG TTCGACCGGA TGCACCACAA CTGGCCCCGC
TCGGACGACG TCATCGAACG GTACGACGAG ATCTGGACCG AAGTCACCGC CGGATAG
 
Protein sequence
MVNRRDIIKG AGAASIAGLA GCLGGDNGGR DIRPVEIDFD DWPPEEYGGN LNAWNWYVEW 
NEWGAEDFAE EYDLDSYSTE AYSTPTDWFS NLQASPENHG IDHIGAFTEW VHRAREEEMI
EPIPIDELPN VEVADQYLDP HRELFWSDDG VGGVYGLPHS VVISPIVMYN TEEVEDPPES
LDILWDEEYA DEISMMAHHG GFLCDVGALY TGQDPNDPDD FEEIQEVLEQ QRDLVFNYAD
EHETQMQLVM SGDAALGTHT DGRAFRAMYN QGGDVDWFIP EEGATWGTDV ILLPQNAPNP
VTATMYIDHL FTDTGWEKFV ETTVYRPPFE NEEFTDGELG DAIREKWDDE WDKHGEAEDF
IDDLVITDEE FDRMHHNWPR SDDVIERYDE IWTEVTAG