Gene Nmag_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2034 
Symbol 
ID8824877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2071958 
End bp2073694 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content66% 
IMG OID 
ProductPyrrolo-quinoline quinone beta-propeller repeat protein 
Protein accessionYP_003480166 
Protein GI289581700 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGAAC ACGATCGGCG GCGCGTGCTG AAGCGAACTG GACTCGCAAC AGTCGGTGCG 
ACACTCGCAG CGGCGAGCAC GTCGAGTGCC AGCGCTGAGA CCGGCACCGA CACGACAGGA
GCGTCGACGA CAACCACTCA CTCAGCCGTC GGTGCTGACA CGACTGTCAC CGAGACAGCC
GGCTGGCCAT CGATCGGCGG TACTCCCGGA AACAACCCGG TCGTCGAGCC GGCAGCCGAA
CCCGAGCCGC CAGTATACGT CGCCTGGGAG TACGAACACG CCGGCCCGAC GGCGATCGTC
GACGACACCG TCTATCTCAC CACCGACGGC GAGGTCCACG CCCTCGACGC AGCCGACGGG
GCACTCGAGT GGGCCACCCA CAACATTGGT GCGAGCGGGA CGCCCGCGGT GCGAGGTGAC
ACCGTCCTGG TCGGCGGCGA GCGCCTGACG CTGATCGACG CTGCCGACGG CGAGATTTGC
TGTCAGCACG ATCTCGGCTA CGACGGGGCG CTGGCCTCGC CCGTTGTCGC GGGCGACTAC
GCGTTCACGG TCGGTGACGG GACCCTGTTC GCGTTCGACA TCGACCCGCG CGAGGTCGCC
TGGGAGTTTA CACCAACAGC CGATACTGAC GAGCACGAAC CGCTGTACGA ACAGCCCGTC
GCCGTCGGCG GCGGGGCCGT CGTCGCCGTC AGCGAGAGCC ATGCAGTTGC GTTCGAACTC
GAGGACGGCA CCAAGCGTTG GCGGGTCGAC GACCCCGTCG GTGATGACGA ATACAGCCGG
TTCATGGAAC CGAACCCGCG CCAGACGAGC TACCCGGTCG CGACGGACGA GGTCGTGGCG
ATCGGCAGCG TGGACACGGG CGATGCTTCG ATGTGGCCGC TTGGCTACAC AACGCTGTAC
GACGTGGAGA CCGGCGAGCG GCGGGTCACG AGCGAGCGCT CGACGTTCGA TCCGGGTGCA
ATCACCGACG AGCGGTTCTA CGCCCTTGAC TCGCACAATG TCAGGGGCTA CGACCGGGAC
AGCGGCGAGG AAAGCTGGGA TCCAAGCAGT ATCACGTACC GCGTCCCCTC GATAGCCGTC
GGCGACGGAA TCGTCTATGC AGGACTGACG CTCGACGGGG CTGGATACGA CCCGGACGAG
GACGACGTAC CAGAGCACTA CGACGGCGTG TACGCCTTCG ACGCGGATAC CGGTGAGATC
GAGTGGTCGG TCGGAACGGA CGGCATTCCG CATATCGCAC TCGCGAACGA GACGGTCTAC
GCCAGTTCGG AAACGCTCGT AGCGCTCCGT TCGGAGAACG ACGACTGGCA CGAGGAGGAG
GCGGACACGG CGGACGACGA GGGTGGGGAC GAATCCGACG ATACGACAGA CGAGGCAGCG
GGCGAGGAGG AGAGCGAGGA CACCACCAGT GAGGAGAGTG AGACGGACAC AGACAACGAC
ACTGGCACTG ACACGGACGC TGACACTGGC ACTGACACGG ACGCTGACAC TGGCACTGAC
AACGAGACCG AAAACAACTC GACCGGATCG GCCGACGGAA ACGGCGGAAC CAACGAAACC
GCCGACAAAT CCACCGAATC TGAAGCCGAC TCGAACGACA ACGACAACAA CAAGGACGGC
ACGCCCGGCT TCACCGCCGG CGCGGGCGTC CTCGGTGCTG GAGCGACACT CGAGTGGCTC
CGCCGACGGG CCGGCGGTGG AAGCAATGCG AGCGGTGGTA CTGACCGACG CGAGTAA
 
Protein sequence
MVEHDRRRVL KRTGLATVGA TLAAASTSSA SAETGTDTTG ASTTTTHSAV GADTTVTETA 
GWPSIGGTPG NNPVVEPAAE PEPPVYVAWE YEHAGPTAIV DDTVYLTTDG EVHALDAADG
ALEWATHNIG ASGTPAVRGD TVLVGGERLT LIDAADGEIC CQHDLGYDGA LASPVVAGDY
AFTVGDGTLF AFDIDPREVA WEFTPTADTD EHEPLYEQPV AVGGGAVVAV SESHAVAFEL
EDGTKRWRVD DPVGDDEYSR FMEPNPRQTS YPVATDEVVA IGSVDTGDAS MWPLGYTTLY
DVETGERRVT SERSTFDPGA ITDERFYALD SHNVRGYDRD SGEESWDPSS ITYRVPSIAV
GDGIVYAGLT LDGAGYDPDE DDVPEHYDGV YAFDADTGEI EWSVGTDGIP HIALANETVY
ASSETLVALR SENDDWHEEE ADTADDEGGD ESDDTTDEAA GEEESEDTTS EESETDTDND
TGTDTDADTG TDTDADTGTD NETENNSTGS ADGNGGTNET ADKSTESEAD SNDNDNNKDG
TPGFTAGAGV LGAGATLEWL RRRAGGGSNA SGGTDRRE