Gene Nmag_3369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3369 
Symbol 
ID8826235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3498818 
End bp3500146 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content63% 
IMG OID 
Productprotein of unknown function DUF444 
Protein accessionYP_003481481 
Protein GI289583015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0981107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACTGA GAGACGACCT CGACCGATTC CGTGAGGTTG GCGAACAGCG CCGCGAAGAC 
CTGGCTGACT TCATCCAGTA CGGCGAACTC GGGAGCGGCC GACCCGACCA GATCAACATT
CCGGTCAAGA TCGTCTCGCT GCCCGAGTTC GAGTACGACC AGCGCGACCA GGGCGGCGTC
GGCCAGGGTG AAGACGGCAC ACCGCAACCC GGCCAGCCGG TCGGCCAGCC ACAGCCTCAG
CCGGGCGACG ACGGTGACGA GGACGGCGAG CCGGGCGAGG AGGGCAGCGA GCACGAGTAC
TACGAGATGG ATCCCGAGGA GTTCGCCGAA GAACTCGACG AGGAACTCGG ACTCGACCTC
GAGCCGAAGG GGAAACGCGT CATCGAAGAG AAGGAGGGTC CATTCACCGA CCTCACCCGA
ACCGGTCCGG ACAGCACGCT CGACTTCGAG CGGATGTTCA AGGAGGGCCT CAAGCGCAAA
CTCACGATGG ACTTCGACGA AGACTTCCTC CGCGAACTCT GCAAGGTCGA GGGCATCACC
CCGCGGGACG TGTTCGAGTG GGCCCGCAAC GAGAACATCC CGGTCTCGAT GGCCTGGGTC
GAAGAAGCCT ACAGCGACAT TCCCGAGTCC GAACGGGGCA CGTGGAGTTC GATCTCAGAA
GTCGAAGCAA ACGTCGAACG CGAGAGCGTC CAGCAGCAGA TCCGACGCGA GGGGATCAAG
CACGTCCCCT TCCGCCGCGA GGACGAACGC TACCGCTACC CCGAGATCAT CGAGGAGAAA
GAGAAGAACG TCGTCGTCGT CAACATCCGC GACGTTTCCG GCTCGATGCG CGAGAAGAAA
CGCGAACTCG TCGAGCGCAC CTTCACGCCG CTCGACTGGT ATCTCCAGGG CAAGTACGAC
AACGCCGAGT TCGTCTACAT CGCCCACGAC GCCGAGGCCT GGGAGGTCGA GCGCGACGAC
TTCTTCGGCA TCCGCTCCGG CGGCGGCACG AAGATCTCGA GTGCGTACGA CCTCGCCGCC
GAGCTGTTAG AGGAGTACCC CTGGAGCGAC TGGAACCGCT ACGTCTTCGC GGCGGGCGAC
TCCGAGAACT CCTCGAACGA CACCGAAGAG CGCGTGATCC CGATGATGGA ACAGATCCCC
GCGAACCTCC ACGCCTACGT GGAAACCCAG CCCAGCGGGA ACGCGATCAA CGCGACCCAC
GCCGAGGAAC TCGAGCGCCA CTTCGGCACC GACGCCGACG ACGTGGCGAT CGCGTACGTC
AACGACGCCA ACGACGTGAC CGACGCGATC TACGAGATTC TCAGCACGGA GAGTGAGTCA
GATGAGTAA
 
Protein sequence
MGLRDDLDRF REVGEQRRED LADFIQYGEL GSGRPDQINI PVKIVSLPEF EYDQRDQGGV 
GQGEDGTPQP GQPVGQPQPQ PGDDGDEDGE PGEEGSEHEY YEMDPEEFAE ELDEELGLDL
EPKGKRVIEE KEGPFTDLTR TGPDSTLDFE RMFKEGLKRK LTMDFDEDFL RELCKVEGIT
PRDVFEWARN ENIPVSMAWV EEAYSDIPES ERGTWSSISE VEANVERESV QQQIRREGIK
HVPFRREDER YRYPEIIEEK EKNVVVVNIR DVSGSMREKK RELVERTFTP LDWYLQGKYD
NAEFVYIAHD AEAWEVERDD FFGIRSGGGT KISSAYDLAA ELLEEYPWSD WNRYVFAAGD
SENSSNDTEE RVIPMMEQIP ANLHAYVETQ PSGNAINATH AEELERHFGT DADDVAIAYV
NDANDVTDAI YEILSTESES DE