Gene Nmag_3975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3975 
Symbol 
ID8828709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp13796 
End bp15076 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content57% 
IMG OID 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003482073 
Protein GI289937471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAC AGGTCGTCAC CCGCACCTAC ACTGCTTCCA TCAGGAATCA GCAACAGGTG 
TCTGACGACC TCGATTCGCT CGGGTTCTCA GCCTCGAAAC TCTGGAACGT CGGACGGTGG
ACAATCAGTC GAATTTGGGA CGAAATCGGC TACATCCCTG AACACGACGA ACTCACCGCG
TACCTCAAGA GCCACGAACG CTATGATGAC CTGCATTCTC AGTCAAGTCA GCGAGTCCTT
CAAGAACTCG CTGAGGCGTT CAACGGCTGG TACGGCAAAC GACGTAACGG AGACGTGAGC
GCGAACCCGC CCGGCTACCG CAAACACGGC GACGACCACC CGCGTTCAAC GGTCACGTTC
AAAGCCGCTG GCTTCAAACT CGACACCCAG TACGAGCGCG TTCGACTCTC AAAAGGGTCG
AACCTCAAGG AATACTGGTC AGACTTCACC CTCTGCGAAT ACCAGACTTG CCCCGACGTT
GACCTCTCCT CAGTCGAGAG CGTCCAACAA GTACGAGCGG TCTGGACGGG TGACGAGTGG
GAACTGCACT TCGTGTGCAA AGTCGAGGTC GACGTGGCCG ACACACCGGG TGAGAAGACC
GTTGGTGTCG ATCTCGGCAT CAACAACTTT GCAGCACTCG CCTACGAGGA CGGCCACAGC
GAGTTGTACC CGCTGAACTG CCTGAAGCAA GATGACTACT ACTTCAGCAA GCGGATTGCT
CGGTGCGACG ACTCCGACTC TGAGCAGGCC ACCCGGCTGA ACCAGAAAAA GTCTCGACGG
CGTACTCACT ACTTCCACAC ACTCTCCAAG CACATTGTTC GGCGGTGTGT TGCGGAAGGC
GTTGGGACGA TCGTGGTGGG CGATCTCTCC GGAATCCGTG AGGATGAGGA GAACGGTGAG
GCGAAGAACT GGGGGACACA CGGCAACCTC GATCTGCACT CGTGGGCGTT CGACCGCTTC
ACCGACCTGC TCGAATACAA AGCCGAGATG GAGGGAATCT CGATTGAGGA GGTATCCGAG
CAGGATACCT CGAAGTCGTG TTCGTGTTGT GGTCGCAAGC GTGACGCGAA CCGTGTTGAA
CGCGGGTTGT ACGTCTGCGA CCAGTGCGGG ATGACGGCGA ACGCAGATGT GAACGGTGCG
GAGAACATTC GACAGAAAGT ATCTCCGAGT CCCGCCACGG ATGGGGGCGA TAGGAGTAAC
GGCTGGTTGG CACAGCCATC GACGTTCCTG TTTGACAAGG AAACTGGCGT GTTCGCGCCT
CAAGAACAGG TCACGTCGTA A
 
Protein sequence
MAKQVVTRTY TASIRNQQQV SDDLDSLGFS ASKLWNVGRW TISRIWDEIG YIPEHDELTA 
YLKSHERYDD LHSQSSQRVL QELAEAFNGW YGKRRNGDVS ANPPGYRKHG DDHPRSTVTF
KAAGFKLDTQ YERVRLSKGS NLKEYWSDFT LCEYQTCPDV DLSSVESVQQ VRAVWTGDEW
ELHFVCKVEV DVADTPGEKT VGVDLGINNF AALAYEDGHS ELYPLNCLKQ DDYYFSKRIA
RCDDSDSEQA TRLNQKKSRR RTHYFHTLSK HIVRRCVAEG VGTIVVGDLS GIREDEENGE
AKNWGTHGNL DLHSWAFDRF TDLLEYKAEM EGISIEEVSE QDTSKSCSCC GRKRDANRVE
RGLYVCDQCG MTANADVNGA ENIRQKVSPS PATDGGDRSN GWLAQPSTFL FDKETGVFAP
QEQVTS