Gene Nmag_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3843 
Symbol 
ID8826713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp232609 
End bp233913 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content59% 
IMG OID 
ProductGlycosyltransferase 28 domain protein 
Protein accessionYP_003481946 
Protein GI289583536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACG ACAAGACGAT CGCGTTTTTC CCGGAAGCGG CATACGGGCC AGCACTAAAT 
TCCGTCGGCA TCGCACAGGA GTGTCGCGAA CTCGGCCACG AACCCGTCTT CCTCACCGAT
CCACCAATGG CGGAGGTCTT CGAAGACCAC GGCTTCGAGA CGTACGAGGT CAACATGGCG
GATCCGTCAC TGACGGCTGA GGAGAAATCG AAGTACTGGG ACGACTTCAT CAACAAGCAC
ATCCCGAACT TCGATAAGGA GCCCTACGAC CAGCTCGACA ACTACATCAC GGAGTGCTGG
GACGCCATCG TCGAAACCGC GAAGTGGGCA CAGCAGGACT TACCGGACGT ACTGGACGAA
GTCGACCCGG ATCTGATCTG CGTCGACAAC GTCGTTCTGT TCCCGGCTAT CAAACAGTAC
GGCGTCCCCT GGGTTCGAAT CGTCTCCTGC GCAGAAAACG AGATTCCAGA CCCCAATATT
CCGCCGTACC TGTCGGGCTG TCGCGCGGAC GATGTCGAGA GCCACCACGA GTTCGAGCGC
CGGTACGACG AACTGATCGC GCCGGTCCAC GACGACTTCA ACGACTTCCT CAGAGAACAC
GGCGAAGAGC CGTATCCGCA CGGGCTGTTC TTCGAGACGT CCCCATACCT CAACCTCCTC
AAATACCCCG AACGACTGCG CTGGGACCGC TGGAACGAAC TCGACCCAGA CCGGTTCCAG
TACCTGAACG GCTGTCTTCG AGACGAGGAC GAAACCTACG AGGTCCCACC GATCGGCGAC
GAGGACGATC CGCTCGTCTA CCTGAGCTAC GGCAGCCTCG GCTCGGGCGA TACGGACCTG
CTGAAGCGCC TCCTCGAGTT CTTCGGCAGC CAGCCCTACC GCTTCCTCGT GAACGTCGGC
GAATACATCG ACGAGTACGA CGACACACAG ATTCCGGACA ACGTCAAAAT CGATAGCTGG
TTCCCCCAGC AGTCGGCCAT CTCGCAGGCT GACGTCGTTA TTCACCACGG CGGGAACAAC
ACGTTCAACG AGTGTCTCTA CTACGGCAAA CCGGCGATCA TTATGCCGTA CGTCTGGGAC
GGACAGGACA ACGCCACTCG ACTCGACGAG ACGAATCACG GCATCAAACT TCACCGCTCT
GACTGGACGC CCGAGGAATT CGCCGAGGCA CTCGAGACCT GCCTGACTGA CGAGGAGATC
CAGGCGAACG TCGCACAGAC CTCGGCCGAC ATGCAGGCAC AGAGCGGAAC AGAAAAGGCA
GCGCGGCTGC TCGATGACGT ACTGGAGGAT CACGATAATG TCTGA
 
Protein sequence
MSDDKTIAFF PEAAYGPALN SVGIAQECRE LGHEPVFLTD PPMAEVFEDH GFETYEVNMA 
DPSLTAEEKS KYWDDFINKH IPNFDKEPYD QLDNYITECW DAIVETAKWA QQDLPDVLDE
VDPDLICVDN VVLFPAIKQY GVPWVRIVSC AENEIPDPNI PPYLSGCRAD DVESHHEFER
RYDELIAPVH DDFNDFLREH GEEPYPHGLF FETSPYLNLL KYPERLRWDR WNELDPDRFQ
YLNGCLRDED ETYEVPPIGD EDDPLVYLSY GSLGSGDTDL LKRLLEFFGS QPYRFLVNVG
EYIDEYDDTQ IPDNVKIDSW FPQQSAISQA DVVIHHGGNN TFNECLYYGK PAIIMPYVWD
GQDNATRLDE TNHGIKLHRS DWTPEEFAEA LETCLTDEEI QANVAQTSAD MQAQSGTEKA
ARLLDDVLED HDNV