Gene Snas_0287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0287 
Symbol 
ID8881466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp299266 
End bp300444 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content65% 
IMG OID 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003509098 
Protein GI291297820 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGACA CGATCAACGA GGCCCAGCTC GTCGGAGCCG TCGAGCACGA CATCAACGCG 
GACGACTTCC CGATCAAGGG CTGGGACCAC ATCCGGTTCT ACGTGGGCAA CGCCAAACAG
GCCGCGCACT ACTACTCCAC CGCCTTCGGC ATGACCCTGG AGGCATACCG CGGCCCCGAA
CAGGGCTTCC GCGAGCACGC CGAGTACATG CTGCGCTCCG GCGGCGTCCG GTTCGTGCTG
GCCGGTGGCA TCACCGCCGA CTCCGCCGCC ACCAAGCACT ACGCCGCCCA CGGCGACGGC
GTCATCGAGG TGGCCCTCGA AGTGCCCAAC GTGGACGACA ACTACGCCTT CGCGATCAAG
CAGGGCGCCG TGGGCGTCGA GGAACCGCAC GACCTCACCG ACGAGTACGG CACCGTCCGC
GTCGCCGCCA TCGCCACCTA CGGCGAGACC CGGCACGTCC TGGTGGACCG GTCGCGCTAC
AACGGCCCGT TCCTGCCCGG CTACGTCGCC GCCAAGCCCA TTGTGGACCG CACGGCGGCC
ATCAAGGACG GCCGCGAACC CAAGCGCTTC TTCCAGGCCC TCGACCACGT CGTCGGCAAC
GTCGAAGAGG GCAAGATGCT CGACTGGGTC ACCTTCTACC AGAAGGTGAT GGGCTTCACC
AACATCGTCG AGTTCGTCGA CGACGACATC GCCACCGAGT ACTCGGCGCT GATGAGCAAG
GTCGTGGCCA ACGGCACCCG CAAGGTGAAG TTCCCGATCA ACGAGCCCGC CGAGGGCCGC
AAGAAGTCGC AGATCGACGA GTACCTGGAG TTCTACGGCG GCCCGGGCGT GCAGCACATG
GCGCTGGCCA CCAACGACAT CCTGGCCAGT GTGGACGCCA TGCGCGCCAA CGGGGTCGAG
TTCCTCGACG CGCCCGACTC CTATTACGAC GACCCTGAGA TGCGCGAACG CATCGGCACC
GTCCGGGTCC CGATCGAGGA GCTCAAGAAA CGCTCCATTC TGGTCGACCG CGACGAGGAC
GGCTATCTGT TGCAGATCTT CACCAAACCG CAGCAGGACC GACCCAGCGT CTTCTACGAA
CTCATCGAGC GTCATGGATC GCTGAGTTTC GGAAAGGGCA ACTTCAAGGC CCTGTTCGAG
GCCATCGAGA AAGAGCAAGC GAAACGCGGA AACCTTTAG
 
Protein sequence
MVDTINEAQL VGAVEHDINA DDFPIKGWDH IRFYVGNAKQ AAHYYSTAFG MTLEAYRGPE 
QGFREHAEYM LRSGGVRFVL AGGITADSAA TKHYAAHGDG VIEVALEVPN VDDNYAFAIK
QGAVGVEEPH DLTDEYGTVR VAAIATYGET RHVLVDRSRY NGPFLPGYVA AKPIVDRTAA
IKDGREPKRF FQALDHVVGN VEEGKMLDWV TFYQKVMGFT NIVEFVDDDI ATEYSALMSK
VVANGTRKVK FPINEPAEGR KKSQIDEYLE FYGGPGVQHM ALATNDILAS VDAMRANGVE
FLDAPDSYYD DPEMRERIGT VRVPIEELKK RSILVDRDED GYLLQIFTKP QQDRPSVFYE
LIERHGSLSF GKGNFKALFE AIEKEQAKRG NL