Gene Nmag_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_4018 
Symbol 
ID8828752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp57566 
End bp58756 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content54% 
IMG OID 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_003482110 
Protein GI289937508 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCGT TAGACGATCT TACCATCGTT GACTTAACCC AATCGATAGC AGGCCCCGTC 
TGTACACAAT TACTCGGGGA AATGGGAGCA ACCGTGATCA AGGTCGAACC CCCGTCAGGC
GATAACTTCC GGAACTTGAT GGGCGGGAGT ATGTTCACCC CGTTCAATCA CGGCAAGCAA
AGCGTTTGCG TCAATATGAA AACCGACGAG GGACACGCAA TCGTTACAGA ACTGGTCGAC
GAAGCCGATA TCGTCGTCGA GAGTTTCCGA CCTGGTGTCC TTGAAAAGTA CGACCTTGAC
TATCAGTCGG TTCGTGAACG AAACGAAGAC ATCATCTATT GTTCACTATC CGGGTTTGGC
CGAACTGGGC CCTACAGTTC GTATCCCGGA TACGACCCAT GTGTCCAGGC ACTCTCCGGT
CTGATGGCAA TCACCGGCTA CCTTGATCGG CCCCCAGTCC GAATCCGCGC GAGCCTGATC
GATTGTGGAA CAGGTGCTAA CGCCGCGTAT GCAATTCTGG CTGCCGTCCG CCAGCGAGAT
CGGGGAGGAT CGGGCACTGA AATCGATATC TCGCTGTTCG ATGTCGCCAT TGCGTGGATG
TCCTACTGGA TCTCGCGATA CGACCGAACC GGAGAACTAC CCGAGCGCGC AGGTGGACAG
GGTATCGGCA GTGCACCAAA TGGTGTCTTT CCGACGGGAG ACGGACACAT CTATCTTGCA
ACCCTGTCTG AAGCGATGTA CGAGCGCCTC TGTCGGTTCC TGGGTCGCGA AGACCTACTC
GAGGACGAGC GGTTCGAGAC AATCGACGAC AGATTGGAAC ACCGTGATCT CATCAAAGAT
GAGTTCACTG CCGAGTTCGA ATCGTACGAT GCGATCGAAC TCGAGAAGGG TCTCATGGAT
GCCGGTGTTC CAACCGGTGC AGTACGAACG GTTAGTGACA TCGTAGACTC GGATCCTCAC
GTTGCTGATC GATCGATGCT CGTCGACTCG TATAATCCAG AAGCTGATGA GGAGGTCGTC
ACGCCTGCAC TCCCGTTCAG ATTCAGTTCA GCGATTCACG ATGGAACGTT CTCCACACAA
CCACCGAAAA AGGGAGAACA CACGGCTGAG ATACTGGAGG CACTCTCCTA TTCCGAAAGT
GAGATTGTGA ATCTGTTTGA TCAAGACGTT GTCTTCGCCG AAGGTCACTG A
 
Protein sequence
MQPLDDLTIV DLTQSIAGPV CTQLLGEMGA TVIKVEPPSG DNFRNLMGGS MFTPFNHGKQ 
SVCVNMKTDE GHAIVTELVD EADIVVESFR PGVLEKYDLD YQSVRERNED IIYCSLSGFG
RTGPYSSYPG YDPCVQALSG LMAITGYLDR PPVRIRASLI DCGTGANAAY AILAAVRQRD
RGGSGTEIDI SLFDVAIAWM SYWISRYDRT GELPERAGGQ GIGSAPNGVF PTGDGHIYLA
TLSEAMYERL CRFLGREDLL EDERFETIDD RLEHRDLIKD EFTAEFESYD AIELEKGLMD
AGVPTGAVRT VSDIVDSDPH VADRSMLVDS YNPEADEEVV TPALPFRFSS AIHDGTFSTQ
PPKKGEHTAE ILEALSYSES EIVNLFDQDV VFAEGH