Gene Nmag_2477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2477 
Symbol 
ID8825330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2537039 
End bp2538205 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content67% 
IMG OID 
Product3-ketoacyl-CoA thiolase 
Protein accessionYP_003480599 
Protein GI289582133 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGCG TACGCGTCGC CGGAACGGGG TTGACACCGT TCGGAAACGC CCCCGAACGG 
ACGAGCAGAG ACCTCTTTGC CGAAGCGACA CAGACCGCGT TCGAAGAGAG CGGCGTTCCA
CGAGCGGACG TCGAAGCAGT GTTCTACGGA AACTTCATGG GCGAACTGGC AGAGCACCAG
GGCCATCAGG GGCCGCTGAT GGCCGAAGCC GCTGGCGTTC AGGCTCCAGC GACACGGTAC
GAATCGGCCT GCGCCTCGAG CGGGATGGCA GTTCGTGATG CCGTGATGCG CGTCCGAAAC
GGCGAACATG ACGTTGTCCT TGTCGGTGGC GCAGAGCGAA TGACCAACCT CGGCACCGCC
GGCGCGACGG AAGCGCTCGC CATCGCTGCC GACGACCTCT GGGAGGTGCG AGCCGGAATG
ACCTTCCCCG GCGCGTACGC GCTGATGGCC CAGGCGTACT TCGAAGAGTT CGGCGGCGGC
CGCGAGGACC TGGCCAACGT CGCGGTCAAG AACCACGACA ACGCGCTGAC CAACGAGAAG
GCCCAGTACC AGACGGCAAT CTCTGTCGAG GAGGCCCTCG ACGCCCCGAC CGTCTCGAGT
CCACTCGGAC TCTACGACTC GTGTCCCCTC TCGGACGGCG CGGCGGCGCT CGTCCTCACG
AGCGAGGAGT ACGCTGACGA ACACGACCTC GACGCACCGG TTGCCATCAC CGGCACCGGG
CAGGGCGGCG ACCGGATGGC GCTCCACGAC CGCGAGCACC TCGCGCGCTC ACCCGCCGCT
CGTGAGGCCG GTACAGAGGC GTACGCGGAC GCCGGCGTCG ACGCCAGCGA CGTAGACCTC
GCCGAGGTCC ACGACTGCTT TACGATCGCC GAAGTGCTCG CCATCGAGGC GCTCGATCTC
GCCCAGATCG GCGAGGGAAT CTCGGCTGCC CGCGACGGGC GGACGACCGC AGACGGCGAC
GTTCCGATCA ACCTCTCGGG CGGGCTCAAG GCGAAGGGCC ACCCGGTCGG TGCGACCGGC
GCGTCACAGA TTGCCGAGGT CACGAAGCTA CTGGCGGGGA CGCACCCGAA CAGCGAGCAC
GTCGCAGATG CGACGACTGG TCTCGCCCAC AACGCGGGTG GAACGGTCGC CAGTGCGACG
GTTCACGTAC TGGAGGTGAT GGAGTAA
 
Protein sequence
MSGVRVAGTG LTPFGNAPER TSRDLFAEAT QTAFEESGVP RADVEAVFYG NFMGELAEHQ 
GHQGPLMAEA AGVQAPATRY ESACASSGMA VRDAVMRVRN GEHDVVLVGG AERMTNLGTA
GATEALAIAA DDLWEVRAGM TFPGAYALMA QAYFEEFGGG REDLANVAVK NHDNALTNEK
AQYQTAISVE EALDAPTVSS PLGLYDSCPL SDGAAALVLT SEEYADEHDL DAPVAITGTG
QGGDRMALHD REHLARSPAA REAGTEAYAD AGVDASDVDL AEVHDCFTIA EVLAIEALDL
AQIGEGISAA RDGRTTADGD VPINLSGGLK AKGHPVGATG ASQIAEVTKL LAGTHPNSEH
VADATTGLAH NAGGTVASAT VHVLEVME