Gene Nmag_1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1156 
Symbol 
ID8823987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1180320 
End bp1181660 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content67% 
IMG OID 
ProductThreonyl/alanyl tRNA synthetase SAD 
Protein accessionYP_003479302 
Protein GI289580836 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0609715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGGC AACGGGCGGC AGCGGAGCCG TACGCCACGC GGTTCGAGAC CGAAGTGACA 
GCGATCGACG GCCGACGAAT CTGGCTCGAG ACGAGCTACT TCTACGGCGA GAGCGGCGGC
CAACCGGCCG ACCGCGGGAC GATCGACGGG GCCGCAGTCG AGGATGTCCA ACTCGCGGAC
GGCAAGCAGG TCCACGTCAT GGCCGAGGAG CCAACGTTCC GAACTGGCCA GCGCGTCCTC
TGCTCGATCG ACTGGGCGTT CCGGATGTAC TGCATGCGCG CACACACTGC CAGCCACGTA
CTCTACGGAG CCGGCCGCCG GCTCCTCGAC GACCTCGGCT ACGGCGGCTT CGACATCGGC
GAGGAGAAGG TCCGCGTCGA CCTCGAGACG ACCAGTGACC TCGACGACGA GACGCTGCTC
GAACTCGACT CGCTCGTGAA CAAAGCCGTC TGGGAGTCCC GACCCGTCTC CTGGGATGAC
ATCCCGGTTG CCGACGCACG CGAGCGCGAG GATATCGCGT TCAACGAGGC CACCGAGGAC
GGCGCGTTCC AGAAGGGGCG CGTCCGCATC GTCACGATCG GTGGCGCGGA CGAGAACGGC
GGCAACGGTA CACGCGCGCG GAACAGATCT TCTAGCGGGC CGACCGTGAC CACGAGCACC
GACGGCTCAG CCGAACCGTG GGATGTCGCC GCCTGCGGTG GCACACACGT CCGTAACACG
CGCGAAATCG GCCCCGTCAC AGTCCTCGGA CGATCGAACC CCGGCGAAGG CATGACGCGC
GTCGAGTTCA GCGTCGGCCC GACGGCTATC GACCGCCGCC GTGAGGAGAA AGCCACCGCG
CTCACCGCGC GTCAGGAACT CGGCGTTCCC CTCGAGGAGG TCGGCGACGA ACTGACTCGC
CTGCAGGACG AACGCGACAA CCTCTCCGCC GAGATACAAA CACTCCAGCG CGACCTGGTC
GACCAGCAAC TCGAGTCCGC CGACTCGTTC GAACGGGACG GACTCGAGTG GCTGGCCGTC
GCGGTCGGAG GAGAAGACGG AGGCGGCGGC GAGAGCGCAG GTGTCGACGC GAACGATGCG
GGCGAAATCG CGCGCGAAGC TGCGGGAGAG CGTGCGGACG TCGTCGTCAT CGCGGGTGCG
GCTGGCTCAC CGTATGCGGT TGCAAGCGTA GCCGAAGACG CACAGGAGAC GATGTCGGCT
GGCTCGGTGA TCGACGCTCT TACGGCTGAG TTCGGCGGTG GTGGCGGCGG CTCGGATGCG
CTCGCACAGG CCGGTGGCTT CGCCGAGTTG CCAGACGAGG ACGAGATTCG GGACGTACTC
GAGTCGGTCG AGTTCCAGTA G
 
Protein sequence
MSGQRAAAEP YATRFETEVT AIDGRRIWLE TSYFYGESGG QPADRGTIDG AAVEDVQLAD 
GKQVHVMAEE PTFRTGQRVL CSIDWAFRMY CMRAHTASHV LYGAGRRLLD DLGYGGFDIG
EEKVRVDLET TSDLDDETLL ELDSLVNKAV WESRPVSWDD IPVADARERE DIAFNEATED
GAFQKGRVRI VTIGGADENG GNGTRARNRS SSGPTVTTST DGSAEPWDVA ACGGTHVRNT
REIGPVTVLG RSNPGEGMTR VEFSVGPTAI DRRREEKATA LTARQELGVP LEEVGDELTR
LQDERDNLSA EIQTLQRDLV DQQLESADSF ERDGLEWLAV AVGGEDGGGG ESAGVDANDA
GEIAREAAGE RADVVVIAGA AGSPYAVASV AEDAQETMSA GSVIDALTAE FGGGGGGSDA
LAQAGGFAEL PDEDEIRDVL ESVEFQ