Gene Nmag_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1940 
Symbol 
ID8824781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1974098 
End bp1975249 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content64% 
IMG OID 
ProductABC transporter, periplasmic binding protein, thiB subfamily 
Protein accessionYP_003480073 
Protein GI289581607 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGAC GGACGTTCGT CCACGGCGTC GGCGGCGGCT CGGTCACCGC ACTCGCTGGC 
TGTCTGACCC GTAACGGCGA GAACGAAGAA CACGCCGCCG AAACGGGGCC GCTGCGAGTC
GCGACCTACA CTTCCTTCGC GACCGGCTCA GACTCCGACC CTGATACCGA CTCCGATCCC
AATCCCGATC CCGATCAAGC CCCCAGCCCC GCCGGTGACT GGTTCCGAGA AACCGTCGAA
GAAGAGTTCG AGGAGGAGAT CGAGTGGACC GTCCCCGAAT CCGGCATCGA ACACTACATC
CAGCGCGCCC GTCTCGACGC CGACATCGAC ACCGACGTCA TCCTCGGCCT CACCGCGAGC
GAACTCGCAC TCGTCGACTC CGTCCTCGAC GCCCACGGCG ACACGCGACT GTTCGAATCG
CTCGAGCGCG ACCGTCTCGA GCACGCCGAC CGAATCCAGT CCGACCTCGC CTTCGACGAC
CCGCGCGACC GCGTACTCCC CGTGGGCACC AGCTACCTCT CGCTCGTCTA CGACGAGACG
GTACTCGAGT CCCCACCCGA GACGTTCGAC GACCTCCTTG ATTCGGCGTA CGCGGACACG
TTGCTCGCGC AGGATCCGCG TGTGTCAAAT CCGGGGCAGG CGTTTTTCCT ATGGACGGTC
GCGGAGTACG GGTCCGGCTC TGGCATGCTC TCCTTCTGGG AGGAGTTGCA GGCGAACGGC
GTTCGCATCG AGGAACGCTG GACGGACGCC TACCGGGATG CCTATCTCGA AGGTGAGCGC
CCGATGGTGG TCTCGTACTC GACGGATCAG GTGGTTGCGG CCGCGACTGA TCGAGACATG
CAGCGCCACC AGGTCGCACC GCTTGACAAC GCGGGATATC GGAGTACTGA GGGGGCAGCG
ATCTTCGCGG ACGCGACGCG GACGGAACTC GCTTACGAGT TCGTCGACCT CCTGTTGTCC
CAGACGGCAC AGGCGGAGCT CGCGACGCGA AACGCGCAGT TCCCCGCCGT CAGTGACGAG
TACGTCGACC TCGATGCGAC GTTCCTCGAG AACGCGGTAG AGCCAGACGA GACAGTAACG
CTCACCTACG ACGACCTTGA GGGAGAGTTC GCGACCTGGC TCGAGACCTG GGACGACGAA
ATCGGAGATT GA
 
Protein sequence
MDRRTFVHGV GGGSVTALAG CLTRNGENEE HAAETGPLRV ATYTSFATGS DSDPDTDSDP 
NPDPDQAPSP AGDWFRETVE EEFEEEIEWT VPESGIEHYI QRARLDADID TDVILGLTAS
ELALVDSVLD AHGDTRLFES LERDRLEHAD RIQSDLAFDD PRDRVLPVGT SYLSLVYDET
VLESPPETFD DLLDSAYADT LLAQDPRVSN PGQAFFLWTV AEYGSGSGML SFWEELQANG
VRIEERWTDA YRDAYLEGER PMVVSYSTDQ VVAAATDRDM QRHQVAPLDN AGYRSTEGAA
IFADATRTEL AYEFVDLLLS QTAQAELATR NAQFPAVSDE YVDLDATFLE NAVEPDETVT
LTYDDLEGEF ATWLETWDDE IGD