Gene Nmag_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_4083 
Symbol 
ID8828817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp125535 
End bp126953 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content66% 
IMG OID 
ProductCarotenoid oxygenase 
Protein accessionYP_003482170 
Protein GI289937568 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAACG CAACCGATCG CGCCTACGAA CTCGGGTTTC GAACGGTCGA CACCGAGTAC 
GCCGACCGGC AGCTACCGGT TGAGGGGACC GTGCCGTCGT GGCTCTCCGG GGCGTTGATC
CGGAACGGCC CGGGCCGGTT CGAGTTCGGC GGCAAGCGTG CGACCCACTG GTTCGACGGT
CTGGCGATGC TCCGCCGCTA CGGGTTCGCG GACGGCACCG TTTCGTATAC AAACCGGTTC
CTACGGACCG ACGCGTACGC GGCCGCCGAC ACCGGCCACG GCGCGGCGGA GTTCGCAACG
GGCGACGATT CGTTCCGCCG GCCCCTGCGG TGGCTCCGGT CGCTCGGACC GCCCGAACCG
ACGGACAACG CGACCGTCCA CGTCGCCCAA CTCGGCGAGC ACTTCGTCGC GCTCACCGAG
GCACCGCGGC GGATCGCGTT CGATCCGGTG ACACTCGAGA CCCGCGGCGA ATTTCGCTGG
CGCGACGACA TTCCGGAGCA TCTGGCGACA GCCCACCTCC AGGTCGATCC CAATCGCGAG
GAAACGATCG GCTACAGCAC GGAGTTCGGT CTCTCTCCGA TGTATCACTT CTACCGGATC
CCCAACGGAC GTGCCGGCCG ACGGCACGTC GCTACCGTGC CGGCCGCTGG CCCCGGGTAC
GTCCACGACT GTTCGATCAC CGAGTCACAT ATCGTCATCG TGGAGACGCC GCTCCGGATC
GCGATGGCGA AGGCACTGGT CCCGTGGACC GACGGATTTC TCGACCTTCT CGAGTACGAC
GAGGCGGCCA CGACCAGGTT CATCGTCGTC GATTGGGACA CGGAGAGCCT CGCCGCGACG
CTCGAAACGT CGCCGTTCTT CACGTTCCAC CACGTCAACG CCTACGAGGA CGACGACGAG
TTGGTTCTCG ACCTCGTCGC GTTCGACGAC GACCAGATCG TCCGAGCGCT CACCTTCGAT
GCCCTCTCGG AGGACGGCTT TGCCGCGGCG CCGGACGGTC GGTTTGTGAG ATTCCGGCTC
CATCCCGGCG AGGGACGCGT CCGACGCTCA GAACGATACG ACGGCGGGAT GGAGCTCCCG
ACAGTTCCCA AACCGGTTCG GGGCCGACAG TACCGGTACG CGTATGCACA GGCGACCGAC
AGGAAGGGTG CAAACGGACT GGTCAAACTC GATGTCGAAC GGGGAACCGC GACGGAGTGG
TGGGAGCGCG GCGTCTACGT CGAAGAGCCG CGGATGGTTC GACGACCGGG CGGGACGGCC
GAGGACGACG GCGTCGTGAT CGCGACGGCG CTGGACACCA AACAGGAGCG ATCGATGCTG
CTCGTATTCG ATGCGGAAAC GGTAGTCGAG AGAGCGCGTG CACCTCTGCC ACACGCCGTT
CCGTTCGGCT TCCACGGTCG GTTCTTTCCG GCGGTGTGA
 
Protein sequence
MSNATDRAYE LGFRTVDTEY ADRQLPVEGT VPSWLSGALI RNGPGRFEFG GKRATHWFDG 
LAMLRRYGFA DGTVSYTNRF LRTDAYAAAD TGHGAAEFAT GDDSFRRPLR WLRSLGPPEP
TDNATVHVAQ LGEHFVALTE APRRIAFDPV TLETRGEFRW RDDIPEHLAT AHLQVDPNRE
ETIGYSTEFG LSPMYHFYRI PNGRAGRRHV ATVPAAGPGY VHDCSITESH IVIVETPLRI
AMAKALVPWT DGFLDLLEYD EAATTRFIVV DWDTESLAAT LETSPFFTFH HVNAYEDDDE
LVLDLVAFDD DQIVRALTFD ALSEDGFAAA PDGRFVRFRL HPGEGRVRRS ERYDGGMELP
TVPKPVRGRQ YRYAYAQATD RKGANGLVKL DVERGTATEW WERGVYVEEP RMVRRPGGTA
EDDGVVIATA LDTKQERSML LVFDAETVVE RARAPLPHAV PFGFHGRFFP AV