Gene Snas_6468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_6468 
Symbol 
ID8887694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6821394 
End bp6822578 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content67% 
IMG OID 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_003515176 
Protein GI291303898 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.213717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATTCCTTACT ACCGCGCCGT AGGGGAGCTT CCTCCCAAGC GGCATACCCA GTTTCGTCAG 
CCGGACGGCA GCCTGTACGC CGAGGAACTC GTCGGCCAGG AGGGCTTCTC GTCCGATTCC
TCGTTGCTGT ACCACCGGCA TCTGCCCACC GCGATCTCGG CTTCCACGCA GCACTTCCCG
ACGGCGGCGG CGCCCGAGCC GAACCACCCG CTCAAGCCGC GGCACCTGCG CACCCACAAG
CTGGACGGCG GCGGCGACCC GATCCTGGAC CGACAGCACC TGCTGTCCAA TGCGGACTGC
CGGATCTCGT ACGCGATGTC GACACAGGCC TCGGTGCTGT ACCGCAACAA CCTCGGCGAC
GAGTGTCTGT ACGTCGAGGA CGGCGCCGCG CGGGTGGAGA CGTCGTTCGG CGTCATCGAC
ATCGTCAGCG GTGACTACCT GATCATGCCG ATGTCGACGA TCTACCGGAT CGTGCCCACC
GGTGATACTC CACTGCGGAC TCTGGTGGTG GAGTCGACCG GTCACATCAC GCCGCCCAAG
CGGTACCTGT CGGTGCGGGG CCAGTTCCTG GAGCACTCGC CCTACTGCGA GCGCGACATC
CGGGGGCCGT CGGAGCCGCT CGTGGTCGAC GGCGAGAACG TCGACGTCTA CGTGCAGCAC
CGGGGCCGGG GCGCGGCCAC CATCTGGACC AAGTTCACCT ACGCCCACCA CCCCTTCGAC
GTGGTCGGCT GGGACGGGCA CATGTACCCG TGGGTGTTCT CGATCCACGA CTTCGAGCCG
ATCACCGGGC GGATCCACCA GCCGCCGCCG GTGCACCAGA CCTTCCAGGG GCCGAACTTC
GTGATCTGCT CCTTCGTACC GCGCAAAGTG GACTACCACC CGGACGCGAT CCCGGTGCCG
TACAACCACC ACAACGTCGA CTCCGACGAG GTGCTGTTCT ACACCGGCGG GAACTACGAG
GCCCGCAAGG GTTCCGGCAT CGAGCAGGGT TCGATCTCGC TGCACCCCTC GGGGTTCACC
CACGGCCCGC AGCCGGGTGC CCCCGAGCGC GCCATCGGTG CCGACTACTT CGACGAGCTG
GCCGTCATGG TCGACACCTT CCGGCCGCTC GAACTGTGCG AGGGCGCGCT GGCCTGTGAG
GACTCCGGTT ACGCCTGGAC CTGGAACCGC ACGCCCGACG CGTGA
 
Protein sequence
MPYYRAVGEL PPKRHTQFRQ PDGSLYAEEL VGQEGFSSDS SLLYHRHLPT AISASTQHFP 
TAAAPEPNHP LKPRHLRTHK LDGGGDPILD RQHLLSNADC RISYAMSTQA SVLYRNNLGD
ECLYVEDGAA RVETSFGVID IVSGDYLIMP MSTIYRIVPT GDTPLRTLVV ESTGHITPPK
RYLSVRGQFL EHSPYCERDI RGPSEPLVVD GENVDVYVQH RGRGAATIWT KFTYAHHPFD
VVGWDGHMYP WVFSIHDFEP ITGRIHQPPP VHQTFQGPNF VICSFVPRKV DYHPDAIPVP
YNHHNVDSDE VLFYTGGNYE ARKGSGIEQG SISLHPSGFT HGPQPGAPER AIGADYFDEL
AVMVDTFRPL ELCEGALACE DSGYAWTWNR TPDA