Gene Nmag_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_4038 
Symbol 
ID8828772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp79285 
End bp80712 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content61% 
IMG OID 
Productcytochrome bd ubiquinol oxidase subunit I 
Protein accessionYP_003482129 
Protein GI289937527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0832301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCGAAC CGCTGTTGAT CGACCCCGAA CTGGGGAGTC GCATTCAGTT CGGCGGCACG 
CTTTCGGTCC ACATTGTCTT CGCCGCCCTC TCGGTCGGGC TCGCCCCCTA CATCGTCTAC
TTCACTTACA AGGAAATCTC GACGGGTCGT GAGAAGTACG AACGGTTGCG CTCGTTCTGG
ACGAAAATCT TCGCCATCGG GTTCGTGATG GGCACGGCGA CGGGGATCCC AATGAGCTTC
CAGTTCGGGA CGAACTTTCC CGCGTTCTCG GAGTTCGCCG GTGAACTCAT CGGCGGCCCC
CTCGCCTTCG AGTCGACGAT GGCATTCTTC CTCGAGGCCG TCTTCCTCGG TGTCCTGTTG
TTCGGCCGCG AACGAGTTAG CGACCGCGTC TACGTCCTCT CGTCGGTGCT CGTGATGGTC
GGTGCGTGGC TTTCGGCGCT GTGGATCCTC ATCGTCAACT CCTGGATGCA GACGCCCCAG
GGTTACGAAC TGATCGAGGA AAACGGCGTG ACAGGACTCG TACTCACCGA TCCCATTGCG
GCGTACTTCA CCCCACGGCT GTTCTGGATG TACGTCCACA TGCAGAACGC GGCGGTGATC
TCAGTGACGC TCTTCGTCGC CGGCGTCGCC GCCTACTTCG TCTGGACGAA CCCCGACAGC
GAGCCCTGGC GCGGGACGCT CAAGCTCTCT GTCGGCGTCC TCGCGATCAC CTCGATCTTC
CAGGTGATCC ACGGCGACAT GTACACTCGC CACGTCGTCC AGACGCAGCC GATGAAGTTC
GCCGCCATGG AGGCAATCTA CGAGACCAAA GAGGGCGCCC CGTTGCACCT GCTCGCGTTC
CCGCGCAGTC TCGAAGATAT CACCAATCCG CGGGCCGAGG AACTGTTCAC GGTCAGTATC
CCGTATCTGG CGTCGTTTCT CGCCGAGACC GACCCGACTG GTATCGTCTA CGGCCTCGAG
GAGTTTGACG TACAGAACCC ACCGGTCGCG TACGTCTTCT GGTCGTTCCG GACGATGGTG
TTTCTCGGCT TCTGGTTCAT CTTCCTTGGT CTGTGGGGCG TCTACCGGAT GCGAAAGGGC
GTGCTCTTCG AGCGTGGACG CTACCTCAAA GCGCTCCTGG CCTCGATCCC ACTCGGCTTC
GTCGCGACCA TCGTCGGCTG GTACGTCACC GAGATCGGCC GCCAGCCCTG GATCATCCAG
GACGTCCAGC TTACGAGCGA GGGTGTCTCT CAGACGCTCA CATCGACACA GATGACGATC
TCGCTTTCCG CCTTTGCGAT CGCCTACGCG ATCCTCGTCG TTCTGTTCCT TCGGGTGATC
AAATGGATCG TCGACGGCGA ACTCGAGCGG GTTCTCGAGG ACGACTTCGA ACGGGTCGAG
CAGGAACAGA CCGACGAACG AGCGCCGAGT GGCTCCGGTG AGGTGTGA
 
Protein sequence
MIEPLLIDPE LGSRIQFGGT LSVHIVFAAL SVGLAPYIVY FTYKEISTGR EKYERLRSFW 
TKIFAIGFVM GTATGIPMSF QFGTNFPAFS EFAGELIGGP LAFESTMAFF LEAVFLGVLL
FGRERVSDRV YVLSSVLVMV GAWLSALWIL IVNSWMQTPQ GYELIEENGV TGLVLTDPIA
AYFTPRLFWM YVHMQNAAVI SVTLFVAGVA AYFVWTNPDS EPWRGTLKLS VGVLAITSIF
QVIHGDMYTR HVVQTQPMKF AAMEAIYETK EGAPLHLLAF PRSLEDITNP RAEELFTVSI
PYLASFLAET DPTGIVYGLE EFDVQNPPVA YVFWSFRTMV FLGFWFIFLG LWGVYRMRKG
VLFERGRYLK ALLASIPLGF VATIVGWYVT EIGRQPWIIQ DVQLTSEGVS QTLTSTQMTI
SLSAFAIAYA ILVVLFLRVI KWIVDGELER VLEDDFERVE QEQTDERAPS GSGEV