Gene Nmag_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2078 
Symbol 
ID8824921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2122421 
End bp2124235 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content67% 
IMG OID 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003480209 
Protein GI289581743 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAG ACGCGGTTCG CGAGCGTGCC CGATCGCTGC CGCGAGAGCC GGGCGTCTAC 
CAGTTCCAGG CCGACGGCAC AACCCTCTAC GTCGGCAAGG CCGTCGACCT CCGGAGTCGC
GTCGGCTCCT ACGCCGATCC GCGAAGCGCC CGCATCAGTC GGATGATCGA CCGCGCAGAC
GCGGTCGAGA TTGCGGTCAC CGACACCGAA ACACAGGCGC TCTTGCTCGA GGCCAACCTG
ATCAAGCGCC ACCAGCCGCG GTACAACGTC CGGCTCAAGG ACGACAAATC GTACCCGATG
GTCCAGTTGA CGGATCACCC CGCCCCGCAG ATCGAGATCA CCCGTGATCC GAGCGAATCC
GCGACCGTCT TCGGGCCCTA CACCAACAAA CGGCAGGTCG AGACCGTCGT GAAGGCCCTG
CGCGAGACCT ACGGCGTCCG CGGCTGTTCG GACCACAAGT ACTCGGGCCG GGACCGTCCC
TGCCTCGACT ACGAGATGGG GCTCTGTACC GCCCCCTGCA CGCGAGAGAT CGAACTCGAG
ACGTACGCCG AAGACGTGAC CGCCGTCGAG CGCTTCTTCG AGGGCGAGAC CGGGATTCTC
ACGGATCCAC TGCGTCGGGA GATGGAAACC GCCGCCCAGG AACAGCACTT CGAGCGCGCG
GCGAATCTGC GGGACCGACT CGAGACGGTC GAAGCCTTCC ACGGTGAGGG CGGCGAGGCG
GTCCAATCGG TCGGCGACGA GCGTGCCGTC GACGTGCTCG GCGTCGCCAT CGAGGGCGAG
GACGCGACGG TGGCTCGACT GCGCGCCGAG AGCGGGAAGC TCGTGGATCG AGAGCGCCAC
ACGCTCGAAG CGCCGGGTGC AGGAACAGAA GCGGGAGCGG AAGCGAAAAC GGCAGCTGCA
GGCGATGGCG GCCTCGCAGC GGGTGCGGAG ACAGCGGACA GCCAGGGCGG CGTCCCCGCC
GTCCTCGCCG CCTTCATCGT TCAGTACTAC GCCGAACGCG ACCTCCCCGA CGCACTCCTC
CTCCCCGAAC GCCACAACGA CGACGAGGTC AGCACCTGGC TCGACGCCGA GGGCGTCGCC
GTCCGCGTCC CCGGTGCCGG TCGCGAGGCC AAACTCGTCG ACCTCGCGCT CAAAAACGCC
CGGCGAAACG TCGGCCGGCG CGACGAGTGT GGCATGCTCG CCGACGCACT CAACCTCGAC
ACGGCCCGCC GAATCGAGGG CTTCGACGTG AGCCATGCCC AGGGAACAGC CGCCGTGGGC
AGCAACGTCA CCTTCGTCGA CGGCAGCGCT GAAAAGGCCG ACTACCGCCG GAAGAAACTC
ACCGACCAGA ACGACGACTA CGACAACATG CGCGCGCTAC TCGAGTGGCG CGCCAGCCGA
GCCGTCGAGG ACCGGGACGA CAGGCCGGAT CCGGATCTCC TGCTGATCGA CGGCGGCGAG
GGGCAACTCG AGGCGGCCCG GGACGCCCTG AGTGCAGTCG GCTGGGACGT ACCTGCAATC
GCGCTGGCAA AGGCCGAGGA GACGGTTATC ATCCCCGATA GACAGCTTTC CTGGCCCTCG
GATGCGCCGC ATCTGCACCT GCTCCAGCGT GTTCGCGACG AGTCTCACCG CTTTGCCGTC
CAGTACCATC AGACGCTGCG TGACGACGTG AGGACGGTGT TAGACGACGT TCCGGGAATC
GGCCCTGAGA CGCGAAAACG GCTGCTCGGT CGCTTCGGGA GTGTGGAGAA CGTTCGCGAG
GCGAGCGTGG AGGACCTACA GAGCGTTTCT GGTGTCGGCG AGAAGACGGC TCGAACGGTG
AAGGAACGGC TGTAG
 
Protein sequence
MNADAVRERA RSLPREPGVY QFQADGTTLY VGKAVDLRSR VGSYADPRSA RISRMIDRAD 
AVEIAVTDTE TQALLLEANL IKRHQPRYNV RLKDDKSYPM VQLTDHPAPQ IEITRDPSES
ATVFGPYTNK RQVETVVKAL RETYGVRGCS DHKYSGRDRP CLDYEMGLCT APCTREIELE
TYAEDVTAVE RFFEGETGIL TDPLRREMET AAQEQHFERA ANLRDRLETV EAFHGEGGEA
VQSVGDERAV DVLGVAIEGE DATVARLRAE SGKLVDRERH TLEAPGAGTE AGAEAKTAAA
GDGGLAAGAE TADSQGGVPA VLAAFIVQYY AERDLPDALL LPERHNDDEV STWLDAEGVA
VRVPGAGREA KLVDLALKNA RRNVGRRDEC GMLADALNLD TARRIEGFDV SHAQGTAAVG
SNVTFVDGSA EKADYRRKKL TDQNDDYDNM RALLEWRASR AVEDRDDRPD PDLLLIDGGE
GQLEAARDAL SAVGWDVPAI ALAKAEETVI IPDRQLSWPS DAPHLHLLQR VRDESHRFAV
QYHQTLRDDV RTVLDDVPGI GPETRKRLLG RFGSVENVRE ASVEDLQSVS GVGEKTARTV
KERL