Gene Gobs_1926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1926 
Symbol 
ID8753597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp1997674 
End bp1998879 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content69% 
IMG OID 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003409000 
Protein GI284990446 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.127663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTCG AGCAGGCACT GAACGATGAC GAGCGCCTGG CGCAGCTGGA CCTCGATCAG 
CTGAAGCAGC TCGTCGGGCT GGTGGAGTAC GACGCCTCCG GCGACCCCTT CCCGGTGTCG
GGGTGGGACG CGCTGGTGTG GGTCGTGGGA AACGCCACCC AGGCGGCGCA CTTCCACCAG
TCGGCCTTCG GCATGGAGCT GGTCGCCTAC TCCGGCCCGG AGACGGGCAA CCGCGACCAC
CTGGCCTACG TCCTGGAGTC AGGCGCGGCC CGATTCGTGG TCAGGGGGGC CTACGACCCG
GCCAGCCCGC TGGCCGACCA CCACCGCAAG CACGGCGACG GCATCGTCGA CATCGCCCTG
TCGGTCCCGG ACGTCGACCG GTGCATCGCG CACGCCGCCG CCCAGGGGGC CACCGTCCTC
GAGCAGCCGC ACGACATCAG CGACGAGTTC GGCACCGTCC GGATCGGCGC GATCGCCACC
TACGGGGACA CGCGGCACAC CCTGGTCGAC CGCTCCCGCT ACACCGGCCC GTACCTGCCC
GGCTACGTCG AGCGCCGCTC CTCCCACGTG AAGCGGGACG GCGCCCCCAA GCGGCTGTTC
CAGGCCGTCG ACCACGTCGT CGGCAACGTG GAGCTCGGCG CCATGGACCG GTGGGTCGAG
TTCTACAACC GCGTCATGGG CTTCACCAAC ATGGCGGAGT TCGTCGGCGA GGACATCGCC
ACGGACTACT CGGCGCTGAT GAGCAAGGTG GTGGCCAACG GCAACCACCG GGTCAAGTTC
CCGCTCAACG AGCCGGCGAT CGGCAAGAAG AAGTCGCAGA TCGACGAGTA CCTGGAGTTC
TACGGCGGTC CCGGCGCCCA GCACGTCGCC CTGGCCACGA ACGACATCCT GACCACGGTC
GACGCGCTGC GCGCCGAGGG CATCGAGTTC CTCGCCACTC CGGACTCCTA CTACGAGGAC
CCGGAACTGC GGGCCCGCAT CGGCGAGGTC CGCGCGCCCA TCGAGGAGCT GCAGGAGCGC
GGGGTCCTGG TCGACCGCGA TGAGGACGGC TACCTGCTGC AGATCTTCAC CAAGCCGCTC
GGCGACCGGC CGACCGTCTT CTTCGAGCTG ATCGAGCGGC ACGGCTCGCT GGGCTTCGGC
ATCGGTAACT TCAAGGCGCT GTTCGAGGCG ATCGAGCGGG AGCAGCACAA GCGCGGCAAC
TTCTGA
 
Protein sequence
MSLEQALNDD ERLAQLDLDQ LKQLVGLVEY DASGDPFPVS GWDALVWVVG NATQAAHFHQ 
SAFGMELVAY SGPETGNRDH LAYVLESGAA RFVVRGAYDP ASPLADHHRK HGDGIVDIAL
SVPDVDRCIA HAAAQGATVL EQPHDISDEF GTVRIGAIAT YGDTRHTLVD RSRYTGPYLP
GYVERRSSHV KRDGAPKRLF QAVDHVVGNV ELGAMDRWVE FYNRVMGFTN MAEFVGEDIA
TDYSALMSKV VANGNHRVKF PLNEPAIGKK KSQIDEYLEF YGGPGAQHVA LATNDILTTV
DALRAEGIEF LATPDSYYED PELRARIGEV RAPIEELQER GVLVDRDEDG YLLQIFTKPL
GDRPTVFFEL IERHGSLGFG IGNFKALFEA IEREQHKRGN F