Gene Nmag_0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0433 
Symbol 
ID8823257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp422211 
End bp423416 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content65% 
IMG OID 
Productprotein of unknown function DUF354 
Protein accessionYP_003478583 
Protein GI289580117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATCG TGATCACAAT ACAGCACGCG GCGAACGTGC ACTTCTTCAA GCACGTCGTA 
GGTGAACTCG AGTCGGCGGG CCACGACGTG TTCGTGTTCG CCCGCGAGAA GGGCGTGGTG
GGGGAGTTAC TCGACGCCTA CGAGATCGAC CACGAACTGC TCTGCGGGGA GCCACGAGGC
TGGCTCGGTC TCGGACTGAC GCAGCTCTGT TACGAGGCCC GGCTCCTCCG TCGAGCGCGA
GCGATCGATC CGGACTACAT CCTGACCAGC CACGGTATCG CCGCCACCCA CGTCGGGACA
CTGGTCGGTG CGGAGAGTCA CGTCTACATC GACACCGAGA CGACGATCAA CGGCGGGAAT
CGGCTCACGA TCCCGTTTAC GGACGTGCTC TACACGCCCG AGAGCTTCCG CGAGACGTAC
GACGCCGAGC ACGTCAGGTA TCCCGGCTAC CACGAACTGG CGTACCTCCA CCCCGACCGG
TTCGACCCCG ATCCGGATCG GCTGCGCACC CACGGTGTCG ATCCCGACGA CCGGTACGCC
GTCCTCCGAT TTGGTGCGTG GAACGGCAAT CACGACATCG GAAAGTCCGG GATCTCGGCC
GCTGGGCGAC ACGAAATCGT CGACGAACTC GCCACCGACG GCCGCGTCTT CGTCGCCGAC
GAGGGAGATG GTCCGCTGCC AACCAGCGCG GAACCGTTGC CTGTCCCGCC GGCTGACTTT
CATCACCTGC TCGCGTTCGC CGACCTCGTC GTCGGCGAGG TCGCGACAAC GACACTCGAG
GCCGGCCTCC TCGGAACGCC GACCGTTCGA ATCAGCCCCT TCGCCGGCAC GTCGGAGATG
GGGAAGTTCC GCGAACTCGC AGAGTACGGC CTCGTCCGCT CGTTTCATAC GGATCACGAG
ACGACGGCGA TTCGCGAACT GACGCGACTC TATCGCGATC CGAGCGCCGC GTCGAACTGG
GCAGACAGAC GCGAGGCGCT GCTCGCGACG AAAATCGATG TCACGCAGTA CATTCTCAGC
CAGATTCTCG CGGACGTACC TGAGCCGACG CCCGAGGACC GGGATTCGGG ACCGACGCTG
GCAGCGTCCG GTTCTGGACC CAGCTCCGGT TCCGACGCCG GATCTGAGGC TCTGAATCAC
CCCGGACCAT GTCCAGAAAC AGAATCCCCG CCGAGGCCGG ACCGGCCATC GGTGAACCGG
AACTGA
 
Protein sequence
MDIVITIQHA ANVHFFKHVV GELESAGHDV FVFAREKGVV GELLDAYEID HELLCGEPRG 
WLGLGLTQLC YEARLLRRAR AIDPDYILTS HGIAATHVGT LVGAESHVYI DTETTINGGN
RLTIPFTDVL YTPESFRETY DAEHVRYPGY HELAYLHPDR FDPDPDRLRT HGVDPDDRYA
VLRFGAWNGN HDIGKSGISA AGRHEIVDEL ATDGRVFVAD EGDGPLPTSA EPLPVPPADF
HHLLAFADLV VGEVATTTLE AGLLGTPTVR ISPFAGTSEM GKFRELAEYG LVRSFHTDHE
TTAIRELTRL YRDPSAASNW ADRREALLAT KIDVTQYILS QILADVPEPT PEDRDSGPTL
AASGSGPSSG SDAGSEALNH PGPCPETESP PRPDRPSVNR N