Gene Nmag_2872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2872 
Symbol 
ID8825731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2949979 
End bp2952414 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content62% 
IMG OID 
ProductFlagella accessory C family protein 
Protein accessionYP_003480987 
Protein GI289582521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.899166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACTCT CGATAATCGG CAACATACTC GGAGACGGCG ATGGCAACAG CGGCACTCGC 
TCGAGCTCCG GGAACGATAG CGATGACATC CTCGGTGGTG ACCTCGCCGG CAGTATGAGC
GACGACGAGT TGTTGCCCGG AGCGCGAGCG GAAAGCGCTA CTGGGGGCAA CGGCGGCGAT
GGCACTGGTG ACGAGGCTCT CTTTGACGAT GGCGATCTCG GCGGTGGCGA CGACTTCCTC
GACGACGACT CGATGTCCAT CGACGGCATG GGCGAGATGT CTGACATGGA CGAGATGGGC
GGTATGGGCG AAATGTCCGA TATGGGGTCG ATGGATGGCA TGGACGCCAT GGACGACGGC
GGAACCGTCT CGAGTGAAGT CGAAGCGCGC GTCGAAGAGA TGGAGAACAG CGTCGGCTCG
CTCTCCTCGA CGGTGAACAC CGTCCAGAGC GAGAACGAAA AGATCGGCGA ATCGCTCGAC
GACATCGAGG AGAACATCCG GAAACTCCTC GAGGTCTACG AGATGGTAAC GCAGGGTGTG
AACCCGTTCG TCGAGGGCGA CTCTCTCGCC GATTCGATGG GCGCTGGTGC CGCTGGCAGC
GGTGACTTTG GCGGACAGAG CCTTTTCGAC AGCGGCGACG GCGGCGAAGC GGACGAGACG
ATCGACGAAG ATATCGCGAA CGCCGAGGCG GACGACTTCC TCGACGAGAG TATCATCGAC
GACGATGATG GCTTCGACGA CGACTTCGAC GACCTAGAGG ACGAGACGAG CATGGACGGA
GACGACAACC TCGACCCCGA TGCAGACGCA GACGCCGCTG GTGACGATGA GCTGTCGTTC
GACGAACTGA AATCTGAATA CGAATCCGGT GACGCCAACT GGGACAGCGA CGAAAGCGCT
GCCGACGACG GCGACACCAC CGACGACGAG GACGACCTGG CAGCCGACGA CGATGACGGT
TTCGACGACG ATCTCGCCGT CGACGGCGAC GAAGCGGACG AGACCGACTC GCTCATCGAC
GACGCTGACG CCATTGCGAG CGACGAGACG AGCGATACAG ACGGCGACAC AGCCGACCTC
GCTGCACACG ACCGTTCCCA CCCCGTGTGG GACGACGGCG GCCGCCCGTA CCTCGAAACC
ATCCCCTCCG AGTACGACAC CGAGTTCGTC GTCATGGACT GGCTCGAGTA CCTGGTCGAC
GAACTGGGGC TCAACGGTGC GGCCCGAACA CTCCGGTTCT ACGAGTCAGT CTACTGGGTG
AGCACGTCGG TGGAATCACA CCTACAGACG GTGCTGAACG GCTTTGGTGG CGGACCGGAC
ATCGGCGAGC CAGAACCACA CTCCTCACTC GGCGTCGATC ACAAACGCAG CCTCTGGTGG
ATCAGCCAGA TCGCAACGCC GGAGAAGAAG CGGCGCCCGT TCGACGTGTG GGTCGACGAA
GAAAATATCA CGGTCGAGCA GGCGATGGCT GTCGCGGAAC AACACCAGCC GGAAGTGGGC
GCTGAGGATG GAGGTGACAG TCACGGTCAC GGTCACGATC ACGATCACGA TCACGGTGAA
AACAAAGTCG AACACAACGA CGCCGACCCA GCCGCCGGAA CTGACCACAC AACTGCAGTC
GACGAGTTCG AACCAGTTGC GGCGACAACT ACGACCACGA CCACGACTAC GACTGACGCA
ATCGACACCG CCGATACTGC CACTGGTGAG GAACTCACGT TCGATGAAAC GACAGTGGCC
GACGATCACG CCGAACTCGA GTCACCGACG GAAACGGCTG CGGGAGACAA CAGTCATATC
GACCAGAATC TCGACCCTGA TCCCGACCCC GACCCCGACT ACGACCTAGA CCCAGACTCA
GACCCAGACC CCGACACAAA CTCATCCGAA GACGACATCG AACTCGAGTT CGCCACGGAC
GAACCACTCG ATCCGTTCGC GGCCGAGGAC GACTCGACCG ACCAGACCGA GGGTGACGGC
GAAGCTGACG GCCGGCCGGC TTCGGGTGCG GACACCGATG GGGAAGTAGC GACGACCACA
CCGGCAGCCG ACACCGAACT CGAGTCCAGC GGCGCACAGA AAATCTTCAT CGAGGAGGCA
GAGGAGGACA CGGTGGAGCA GAACCACCGC AACGAACCGG CCGGCGAGCG AGTCGAGCCG
GAAGCGGAGG TGACGGATGG TGGGCAGATG ATCTGGGTCG ATTCGGATGT CGTGCTCTCC
GAGTCCGGTG CCAGACTGTG CAACACGCGC GCGACCACCG GTGGCGTCGA CCGTGGGGAG
CAGGCCGAAA TCGCAAAGCC GCTCGTCGTT TCGGACGAAC CGGCGGATCT CGACGGCTGG
CAGGTTGAAC GGATCAAGCT CCTGCTCGCG CCGGAGGAGT TCGAGGGTGG GTGTGAGCAG
CCAACCGACG AGGCTGCACA CGACCACGAA CAGTAA
 
Protein sequence
MVLSIIGNIL GDGDGNSGTR SSSGNDSDDI LGGDLAGSMS DDELLPGARA ESATGGNGGD 
GTGDEALFDD GDLGGGDDFL DDDSMSIDGM GEMSDMDEMG GMGEMSDMGS MDGMDAMDDG
GTVSSEVEAR VEEMENSVGS LSSTVNTVQS ENEKIGESLD DIEENIRKLL EVYEMVTQGV
NPFVEGDSLA DSMGAGAAGS GDFGGQSLFD SGDGGEADET IDEDIANAEA DDFLDESIID
DDDGFDDDFD DLEDETSMDG DDNLDPDADA DAAGDDELSF DELKSEYESG DANWDSDESA
ADDGDTTDDE DDLAADDDDG FDDDLAVDGD EADETDSLID DADAIASDET SDTDGDTADL
AAHDRSHPVW DDGGRPYLET IPSEYDTEFV VMDWLEYLVD ELGLNGAART LRFYESVYWV
STSVESHLQT VLNGFGGGPD IGEPEPHSSL GVDHKRSLWW ISQIATPEKK RRPFDVWVDE
ENITVEQAMA VAEQHQPEVG AEDGGDSHGH GHDHDHDHGE NKVEHNDADP AAGTDHTTAV
DEFEPVAATT TTTTTTTTDA IDTADTATGE ELTFDETTVA DDHAELESPT ETAAGDNSHI
DQNLDPDPDP DPDYDLDPDS DPDPDTNSSE DDIELEFATD EPLDPFAAED DSTDQTEGDG
EADGRPASGA DTDGEVATTT PAADTELESS GAQKIFIEEA EEDTVEQNHR NEPAGERVEP
EAEVTDGGQM IWVDSDVVLS ESGARLCNTR ATTGGVDRGE QAEIAKPLVV SDEPADLDGW
QVERIKLLLA PEEFEGGCEQ PTDEAAHDHE Q