Gene Nmag_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3803 
Symbol 
ID8826673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp185765 
End bp187537 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content58% 
IMG OID 
Productcytochrome c oxidase subunit I 
Protein accessionYP_003481906 
Protein GI289583496 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGATC TCTTCGTCGA CAGATATCCC GATCAGGCAC GGGTCGTCCG TGCAGCGTTT 
ACGACCGCGT ATATTGCTCT AGGAATCGGT GCGCTGTTCG GTGCCTTGCA GGCGTTGCAC
CGGACCGATA TTCTACGACT CGTCGAGTCG ACGACGTACT ATACCATTCT AACCGGACAC
GGCGTCTTTC TGGTCATCTC CTTCACAATC TTCTTTCTCG TAGGGCTTTA CCAGTGGGCG
GTGACGGACA GCCTGAATCG AGGCCCTGTC GACATGCGTC TCACCTGGAC GTGGTACGGG
TTGATGTCGA TTGGGACGTT GTTGGCTGGG ATCGCGATAC TGGCAGGGTT TCTGGACGAT
CCACCATCTG TTCTCGGCTC CGAACTGAGC GCCGACGTTC TCTTTACGTT CTACGCACCG
TTACAGGCGA ACCCGATCTT TTACATTGGG CTCGTCCTGT TCGTCGTTGG AACGTGGCTC
GCCGGCGCCG ACTGGTTCCG AACCTGGTGG GCATGGAAGA AGGAAAACCC GGGCGAACGG
ATCCCGCTTC CAACGTTCAT GGTGTTAACG ACCATGATCA TGTGGTACAT CTCCTCGATC
GGGGTGGCAG TCGCGATCCT CGCGTTTATC CTGCCGTGGT CGCTCGGCCT GATCGATAGT
CTCAACCCCA CGCTAACCCG GACGTTGTTC TGGTACTTCG GCCACCCAGT CGTCTACTTC
TGGCTGTTGC CCGCATATAT GCTGTGGTAC ATCGTCCTGC CAAAGCTCTC GGGCGGCCGG
CTGTTCAGCG ACCCACTGGC ACGCGTCGTC TTCATCCTCT TCGTGTTGCT CTCAACGCCG
GTCGGTATTC ACCACCAGTA CCTTGATCCG GGAATCTCTG AAGGATTCAA ATACATAGTA
ATGACAAATA CCATGTTCCT CCTGTTGCCG AGCCTGTTGA CAGCCTTTAC CGTCGTCGCA
AGTATGGAAC ATGGGGCGCG CCAGCGCGGC GGTGAGGGCA CGTTCAACTG GCTCCGAGCG
CTCCCCTGGC GTGACCCTGC GTTCACGGGA ATGGCCCTCG CCGGTCTCGT CTTTGCATTC
GGCGGATTTA CCGGAATCGT CAACGCCGGC ATGAACATCA ACTATCTCGT CCACAACTCG
CTGTGGGTGC CCGGCCACAT CCACACGCAG GTCGGGACTG CCGTCGCGTT AACGTTCATG
GCCGGATCGT ACTGGTTGAT TCCACAACTG ACCGGCAACC GTCTGGTTGG ACGCCAGCTA
GCACTGGTAC AGGTCGTCCT CTGGTTCGTC GGAATCGTCT TCATGACGAA CTCGATGTAC
CGGGGTGGGC TGATTGGAAT CCCCCGGCGA ACCGCGGAAC CGCAATACTC GTTCGACTAC
GAGATTGCCG TCGGCTCGAT TCCCGAACTC CGCGCCCAGC TTGCCATCGG CGGCATGTTA
CTGTTCATCT CTGCACTGCT CTTTCTGACG ATTATCCTCC TGACTGCGTT CAACAACGAC
AGCATCCCCG TCGTCGACGG GACGATTCCG CCGGCACTGT CCGGTCCCGA GGACTCACCT
CGTATTCTCG ACGACCTTCG GGTATGGATC GCGATTGCAC TGGTCCTCAT CGTTATCGCC
TACGCCTTCC CGCTCGCCAA TATCGTGAGC CGCGGCGGGT TGTTCGGTCC TGACATCGGA
CCCTTCCCCG TCATCGTGGA AACCCTTACG TCTCTTCCGC CAGTCGCCGA CGCATCCATG
TACCTCGAGA CCGTACTCGA CGGTAAGTCC TAG
 
Protein sequence
MSDLFVDRYP DQARVVRAAF TTAYIALGIG ALFGALQALH RTDILRLVES TTYYTILTGH 
GVFLVISFTI FFLVGLYQWA VTDSLNRGPV DMRLTWTWYG LMSIGTLLAG IAILAGFLDD
PPSVLGSELS ADVLFTFYAP LQANPIFYIG LVLFVVGTWL AGADWFRTWW AWKKENPGER
IPLPTFMVLT TMIMWYISSI GVAVAILAFI LPWSLGLIDS LNPTLTRTLF WYFGHPVVYF
WLLPAYMLWY IVLPKLSGGR LFSDPLARVV FILFVLLSTP VGIHHQYLDP GISEGFKYIV
MTNTMFLLLP SLLTAFTVVA SMEHGARQRG GEGTFNWLRA LPWRDPAFTG MALAGLVFAF
GGFTGIVNAG MNINYLVHNS LWVPGHIHTQ VGTAVALTFM AGSYWLIPQL TGNRLVGRQL
ALVQVVLWFV GIVFMTNSMY RGGLIGIPRR TAEPQYSFDY EIAVGSIPEL RAQLAIGGML
LFISALLFLT IILLTAFNND SIPVVDGTIP PALSGPEDSP RILDDLRVWI AIALVLIVIA
YAFPLANIVS RGGLFGPDIG PFPVIVETLT SLPPVADASM YLETVLDGKS