Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_1926 |
Symbol | |
ID | 8753597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 1997674 |
End bp | 1998879 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_003409000 |
Protein GI | 284990446 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.127663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTCG AGCAGGCACT GAACGATGAC GAGCGCCTGG CGCAGCTGGA CCTCGATCAG CTGAAGCAGC TCGTCGGGCT GGTGGAGTAC GACGCCTCCG GCGACCCCTT CCCGGTGTCG GGGTGGGACG CGCTGGTGTG GGTCGTGGGA AACGCCACCC AGGCGGCGCA CTTCCACCAG TCGGCCTTCG GCATGGAGCT GGTCGCCTAC TCCGGCCCGG AGACGGGCAA CCGCGACCAC CTGGCCTACG TCCTGGAGTC AGGCGCGGCC CGATTCGTGG TCAGGGGGGC CTACGACCCG GCCAGCCCGC TGGCCGACCA CCACCGCAAG CACGGCGACG GCATCGTCGA CATCGCCCTG TCGGTCCCGG ACGTCGACCG GTGCATCGCG CACGCCGCCG CCCAGGGGGC CACCGTCCTC GAGCAGCCGC ACGACATCAG CGACGAGTTC GGCACCGTCC GGATCGGCGC GATCGCCACC TACGGGGACA CGCGGCACAC CCTGGTCGAC CGCTCCCGCT ACACCGGCCC GTACCTGCCC GGCTACGTCG AGCGCCGCTC CTCCCACGTG AAGCGGGACG GCGCCCCCAA GCGGCTGTTC CAGGCCGTCG ACCACGTCGT CGGCAACGTG GAGCTCGGCG CCATGGACCG GTGGGTCGAG TTCTACAACC GCGTCATGGG CTTCACCAAC ATGGCGGAGT TCGTCGGCGA GGACATCGCC ACGGACTACT CGGCGCTGAT GAGCAAGGTG GTGGCCAACG GCAACCACCG GGTCAAGTTC CCGCTCAACG AGCCGGCGAT CGGCAAGAAG AAGTCGCAGA TCGACGAGTA CCTGGAGTTC TACGGCGGTC CCGGCGCCCA GCACGTCGCC CTGGCCACGA ACGACATCCT GACCACGGTC GACGCGCTGC GCGCCGAGGG CATCGAGTTC CTCGCCACTC CGGACTCCTA CTACGAGGAC CCGGAACTGC GGGCCCGCAT CGGCGAGGTC CGCGCGCCCA TCGAGGAGCT GCAGGAGCGC GGGGTCCTGG TCGACCGCGA TGAGGACGGC TACCTGCTGC AGATCTTCAC CAAGCCGCTC GGCGACCGGC CGACCGTCTT CTTCGAGCTG ATCGAGCGGC ACGGCTCGCT GGGCTTCGGC ATCGGTAACT TCAAGGCGCT GTTCGAGGCG ATCGAGCGGG AGCAGCACAA GCGCGGCAAC TTCTGA
|
Protein sequence | MSLEQALNDD ERLAQLDLDQ LKQLVGLVEY DASGDPFPVS GWDALVWVVG NATQAAHFHQ SAFGMELVAY SGPETGNRDH LAYVLESGAA RFVVRGAYDP ASPLADHHRK HGDGIVDIAL SVPDVDRCIA HAAAQGATVL EQPHDISDEF GTVRIGAIAT YGDTRHTLVD RSRYTGPYLP GYVERRSSHV KRDGAPKRLF QAVDHVVGNV ELGAMDRWVE FYNRVMGFTN MAEFVGEDIA TDYSALMSKV VANGNHRVKF PLNEPAIGKK KSQIDEYLEF YGGPGAQHVA LATNDILTTV DALRAEGIEF LATPDSYYED PELRARIGEV RAPIEELQER GVLVDRDEDG YLLQIFTKPL GDRPTVFFEL IERHGSLGFG IGNFKALFEA IEREQHKRGN F
|
| |