Gene Nmag_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3333 
Symbol 
ID8826198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3464092 
End bp3465279 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content65% 
IMG OID 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_003481445 
Protein GI289582979 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGCAC TTTCGAATCT GCGCGTGCTG GATCTGACGC AGGTCCTCGC GGGCCCGTAC 
TGTACGATGT TACTCGCGGA CATGGGCGCT GACGTGGTCA AAATCGAACG ACCTGGCGGC
GACCTCATCC GCTCGAATCC GCCGTTCGTC GACGACCCCG AGAAGGAAGC CTACGGCGGC
TACTTCCAGA GCGTCAACCG CGGCAAGCGC AGCATCGAAC TCGACTTCAA CGACGACGAG
GACCGCGCGG ACTTCCTCTC GCTGGTCGAG GAAGCCGACA TCGTCGTCGA GAACTACCGC
GCGGGCACGA TGGAGAAGTA CGACCTGGGC TACGAAACGC TCACCGAGTA CAACCCACAG
CTGATCTACT CCTCGATCCG TGGCTTCGGC GATCCGCGCA CGGGCGAGAC GGACCGACAG
GGCCAGCCCT CCTTCGACCT CATCGCACAG GCGCTCGGCG GCGTCATGGA GACCACCGGC
CAGGAGGACG GCCCGCCGAC GAAGGTCGGC CCCGGTATCG GCGACCTCTT CACGGCCACG
CTGAACTGTA TCGGCATCCT CGCCGCCGTC AACCACCGCG AGCAGACCGG CGAGGGCCAG
TACGTCGACA CCGGGATGTA CGACTCCATG CTCAGCCTGA CCGAGCGCGC CATCTACCAG
CAGTCTTACA CCGGCGAGGC ACCCTCCAGA CGGGGTAACT CCCACCCGAC GCTGTTCCCC
TACGACGCGT TCGAAACCGC GGACGGTCAT ACCGTCATCG CCGCCTTCGG AACGAATCAC
TGGAACGAAG TCTGTGACGC GATGGGCCGC GAGGACCTCG CCGAGGAGTA CCCCACCGCT
GCGGAGCGCC TCGAAAACCG AGAGTCGCTG CGCGAGGAAA TCGCCGACTG GGCCAGCGGA
CTGACCAACG ACGAACTCGT GGGGACACTC GAGGGCCGGG TCCCTGTCGC ACCGGTCCAG
ACCACCGAGG AGATTTTCGA GGACCCGCAC GTCGAAACGC GAGAGATGCT CGTGCCGGTG
GAACAGCCTG GAACGGACGA GGAAGTCGAG ATCGCGGGCT CGCCGATCAA GATGACCGAG
ACGCCGCCGC AGCCACGTGG TCGCGCGCCG TTGCTCGACG AGCACCGGGA GGAGGTGCTC
GGCTCGGATA AGGAAACGAC TGATGTGGAA CAGGCGGCTG ACGACTAG
 
Protein sequence
MGALSNLRVL DLTQVLAGPY CTMLLADMGA DVVKIERPGG DLIRSNPPFV DDPEKEAYGG 
YFQSVNRGKR SIELDFNDDE DRADFLSLVE EADIVVENYR AGTMEKYDLG YETLTEYNPQ
LIYSSIRGFG DPRTGETDRQ GQPSFDLIAQ ALGGVMETTG QEDGPPTKVG PGIGDLFTAT
LNCIGILAAV NHREQTGEGQ YVDTGMYDSM LSLTERAIYQ QSYTGEAPSR RGNSHPTLFP
YDAFETADGH TVIAAFGTNH WNEVCDAMGR EDLAEEYPTA AERLENRESL REEIADWASG
LTNDELVGTL EGRVPVAPVQ TTEEIFEDPH VETREMLVPV EQPGTDEEVE IAGSPIKMTE
TPPQPRGRAP LLDEHREEVL GSDKETTDVE QAADD