Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_2871 |
Symbol | |
ID | 8754543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 2995643 |
End bp | 2997496 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_003409869 |
Protein GI | 284991315 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.872704 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCTGGG CCCGCTTCGA CGACCTCCGC AGCGGGACGG CGCTCCGGTG TCCCGCGCCC GACCGGATCC TGGTGGCCGA ACACCCGGGT GAAGTCGTCG GCGTGCTGGC CGAGGTCCAG CGGGCCACGG ACTCCGGGCG GTGGGCGTTC GGCTACGTCG CCTATGAGGC CGCTGCCGGG CTGGATCCAC GGCTCGCCGT GCACCGATCC ATGCCCATGG GCATGCCGCT GGTCTGGTTC GGGGTCTGCG ACCAGCCGGT TCCCGTGCCT CCGCTGGAGC CGGCCGGGCC GGCCGGCGCC GGCCGAGGCG GGGCGGCCCG GTGGCAACCC ACGTGGACAC CGGCCGGGCA TGCCGACGGC GTCCGGCAGG TCCACGAGCG GATCGCCGCC GGGGACACGT TCCAGTGCAA CCTGACCGTC CGGATGTCCG GTCGCGTGTC AGGGGATCCC TTCGCCCTGT ACCGGGACCT GGCCCTGGGT CAGCGCGGAG CCCACAGCGC CTATCTCGAC CTCGGCCGCT TCGCCGTGGC CAGTGCCAGC CCCGAGCTGT TCTTCGAGCG CCGTGGCGAC GCGGTGCTGC TCCGCCCCAT GAAGGGCACG GCGCGGCGGG GACGGGACCG GGAGGAGGAC CGGCGCCTGG CCCACCGGCT GCAGTCCAGT CCCAAGGAGC GGGCGGAGAA CGTCATGATC GTCGACCTCA TGCGCAACGA CATCGGCCGG ATCGCCGAGA TCGGCAGCGT CGACGTGCCG GCGCTCTTCA CCGTCGAGCG CTACGAGACC GTGCTCCAGC TCACCTCCGA CGTCACGGCT CGACTCTCGC CTGGAACCGG CCTGGTCGAG CTGTTCCGGG CGCTCTTCCC CTGCGGCTCG GTCACGGGAG CGCCCAAGGC GAGCTCCATG GAGATCATCC GGTCCCTGGA ACCCGACCCG CGCGGCGTCT ACTGCGGAGC CATCGGCCTG GTGGGTCCAC CGGACGCGCC GGTCCGGGCG CGGTTCAACG TGGCCATCCG GACGGCCGTG GTGGACAGGT CCTCGGGAGA GGCCGTGTAC GGCACCGGCG GTGGCATCAC CTGGGGCTCG GAGGCGTCGG CCGAGCATGC CGAGCTCCTC GCCAAGGCCG CGGTCCTGTC CGCGCGGCCC CGGGAGTTCG AGCTGCTGGA GACGATGCGG CACGACCCTG AGCGCGGTCT GCGTAACCGG GAGCGTCATC TGCACCGCCT GGCAGCCTCG GCCGAGCACC TGGGGTTCCG GTTCGACCTG CCGACGGCGC GACGCGTCCT GGCCGTGCGG CTGGCGGGAG AGCCGGCGGC ACGCGTCCGG ATACGCCTCC GGCGCGACGG GACGCTCGCC GTCGACGTCG AGGCGCTCCC TGCGCCCTCG ACCGGCCCGG TGCTGCTCGC GGTCGACGAC GATCCGGTCG ATCCCCGGGA GACCTGGCTC TACCACAAGA CGAGCCTGCG GGAGCCGTAC GACCGGCGCC GCGAGCGGCG ACCCGACGTC GACGACGTGA TCATGGTCAA CACGAGGGGA GAGCTCACCG AGGTGACCCG AGCCTCCCTC GCAGTGGAGC TCGACGGGTG CTGGTGGACG CCTCCACTGG AGGCCGGGTG CCTTCCCGGC GTCGAGCGCG CGCGGCTGCT CGAGATGGAC AGGCTGCAGG AGCGGGTGCT GCGCGTGGCC GACCTCGAGC GGGCGGAGGG GGTGGCGGTG CTCAGCTCGC TGCGGGGATG GCGTGCCGCA GAGTTGAGCG GCGTTCGTCG CAAGGCGGCG CCCATCCGAG GCCGGAAGGA GCCGGTACGG ACCTCCCCCG GGGCGGGGAG CCTGGCGCCT GCGGCACTCA TCGGCGGGGG ATGA
|
Protein sequence | MTWARFDDLR SGTALRCPAP DRILVAEHPG EVVGVLAEVQ RATDSGRWAF GYVAYEAAAG LDPRLAVHRS MPMGMPLVWF GVCDQPVPVP PLEPAGPAGA GRGGAARWQP TWTPAGHADG VRQVHERIAA GDTFQCNLTV RMSGRVSGDP FALYRDLALG QRGAHSAYLD LGRFAVASAS PELFFERRGD AVLLRPMKGT ARRGRDREED RRLAHRLQSS PKERAENVMI VDLMRNDIGR IAEIGSVDVP ALFTVERYET VLQLTSDVTA RLSPGTGLVE LFRALFPCGS VTGAPKASSM EIIRSLEPDP RGVYCGAIGL VGPPDAPVRA RFNVAIRTAV VDRSSGEAVY GTGGGITWGS EASAEHAELL AKAAVLSARP REFELLETMR HDPERGLRNR ERHLHRLAAS AEHLGFRFDL PTARRVLAVR LAGEPAARVR IRLRRDGTLA VDVEALPAPS TGPVLLAVDD DPVDPRETWL YHKTSLREPY DRRRERRPDV DDVIMVNTRG ELTEVTRASL AVELDGCWWT PPLEAGCLPG VERARLLEMD RLQERVLRVA DLERAEGVAV LSSLRGWRAA ELSGVRRKAA PIRGRKEPVR TSPGAGSLAP AALIGGG
|
| |