Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gbro_3487 |
Symbol | |
ID | 8552865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gordonia bronchialis DSM 43247 |
Kingdom | Bacteria |
Replicon accession | NC_013441 |
Strand | - |
Start bp | 3701738 |
End bp | 3704722 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | protein of unknown function UPF0182 |
Protein accession | YP_003274573 |
Protein GI | 262203365 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.879068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGTTC GCGGGCCGGC CGGTATGCCG ACACTGTCAC GCAAGAGCAA GATCGCCATC GGGGTCGGCG TCGGCATCCT GGTGCTGTTG CTGGTCGGGC CGCGGCTGGT GTCGATCATC ACCGACTGGC TCTGGTATCG CGACATCGGG TTCACGCAGG TGTTCTCCAC CATCGCCTGG ACGCGGATCA TCTTGTTCCT GGTCACCACG ATCTTCGTCG GCGGCCTCGT CTTCGGCGCC ATCGCGATCG CGTACCGGTC ACGGCCGGTG TTCGTGCCGA CCGCCGGCCC CAACGATCCG CTCGCGCGCT ACCGCACCGC GATCATGGGG CGGTTGCGTT GGTTCGGGAT CGTGCCCGCG GTGATCATCG GCGCGCTCGC CGGTCTCGTC GCCCAGGGGT CGTGGGCCAC GGTCCAGATG TTCCTGCACG GTACCGACTA CGGGACCCAG GATCCGCAGT TCGGTCTGGA CATGGGGTTC TACGCCTTCG ATCTCCCGTT CTATCGCTTC ATCCTGAACC TGCTCTTCGT GGTGGTCGTC ATCGCGTTCA TCGCGAACCT GGTGACCCAC TATCTGTTCG GGGGTTTGCG GCTCGGTGGC GGCGGTGGAT CGCTGACCAC CGCGGCACGG GTCCAGCTGG CCGTCTTGGC CGGAACGTTC CTGCTGCTCA AGGCGATTGC GTACTGGTTC GACCGATACA CGCTGCTCAC CAGCGACCGT AAGGCGGATA TTTTCCCGGG TGCCGGCTAC ACCGACATCA ACGCGGTCCT GCCGTCGAAG CTGATCTTGA TGTCCATCGC GATCATCTGT GCGGTGGCGT TCTTCGCCGG CGTGGTCCTG CGCGACCTGC GTATCCCGGC CCTGGCCACG GTGCTGATGT TGTTCTCGGC GTTGCTGATC GGTGTTGGCT GGCCGCTGGC GATGGAGCAA TTCTCGGTCA AACCGAATGC GGCGCAGAAG GAATCGGAAT ACATCGAGCG GGCGCTCGAC TCCACGCGGC AAGCCTACGG ACTCGGGTCG GATCACGTCT CCTATGAGCG GGACTGGACT GCCCAGCCGG CGAATGCCGC GACCGTCAAC GCGGACACCA ACACGCTGTC CAACATTCGC ATCCTCGACC CGAATGTGGT GTCGCCGACC TTCCGTCAGC AGCAGCAGCG TCGCAACTTC TACGGCTTCC CGACCCAGCT GGCCGTCGAC CGCTATCGGG TGGACGGGCA GTTGCGCGAC TACGTCGTCT CGGTGCGCGA ACTCGACCCG AGCCAGTACC AGGGCAACCA GCAGAACTGG CTCAACAAGC ATCTGGTGTT CACCCACGGC GACGGGTTCG TCGCGGCGCC GGCCAACCGT GTGCGGGAAG CCGCCGACGA CAACGACATC GACAGCGGTG GCGATCCGCT CTACACCGTC ACCGACACCA CCAACGTCAA CGACGAGAAC CATCAGCGCG AGGCCCCGAT CAAGGTCAAG CAGCCGCGCA TCTACTTCGG CGAGCTCATC GCAAAGGTCG ACCCCGACTA CGCGATCGTC GGCTCCGACA ACGGTCAGCC GCGTGAGTTC GACGGCGGTA ACGATGACGG AGCCCGCTAC ACCTACACCG GTGATGCCGG TGTGGGCCTG GGCAATTGGT TCACCCGTGC CCTGTACGCG GTGAAGTTCA CCGAGCGCAA CTTCATCCTG TCCAGTGAGA TCACCAACGC CTCGCGTATC CTCTACAACC GCGACCCGCG GGACCGGGTG AAGAAGGCGG CACCGTGGCT GACCGTCGAC TCCAAGACAT ACCCCGCGGT GATGGCCGAC GGTTCGATCA AGTGGATCGT GGACGGCTAC ACCACTCTTG ACAGCTACCC GTACTCGCAG CCGACGTCGT TGCAGAGTGC GACGGCCGAC GCTCAGGACC TCAACCCGGG ACAGACCGGG CGCAGCCAGG TGAACAAGAC CGTCGGTTAC GTCCGCAACT CGGTGAAGGC CACCGTGGAC GCCTACACCG GCGATGTGGA GCTCTACCAG TTCGACACGC AGGACCCGGT TCTCAAGACC TGGATGAAGG TGTTCCCGGG AACGGTCAAG AGCCGCGCCG ACTTCGACAA GAACACCTCG CTGCGCGAGC ACGTGCGCTA TCCGGAGGAT CTGTTCAAGA TCCAGCGCTA CCTGCTGACC CAGTACCACG TCGATTCGCC GCAGACGTTC TTCCAGGGCA ACGATCGTTG GTCGGTCCCG CCGGACCCCA CCAATTCCGA TGCGGCACAG CGCGGTCTGG ATCAGCCGCC GTACTACTTC GTGGCGGCCA GCCCGGAGAA CGGACAGTCG TCGTTCCAGT TGACGTCGGT GCTCAACCGT CTCGACCGAC CGATCCTCGG TGCCTATGTG ACGGTGTCCT CGGACCCGGA CAACTACGGT CAGATCACCG TGAAGGAGTT GCCGGGCAAC AATCAGCGCT CGGGTCCGGT GCAGGCGTTC AACCCGATGA AGACCGATCC GCGGGTGGCC CAGTCACAGC GTGATCTGCA GGCCACCGCG ACGGTGACCT TCGGCAACCT GCTGACCTTG CCGGTGGGTG ACAACGGGAT CCTGTATGTG GTGCCGATGT ACGCACAGGC CCAGGCGGCC GAGGCGTTCC CGCGCTTGTT CCGGGTGATC ACCCGCTACG AGCCGGCCGG CGGGCAGGCG AGTATCGGCT ACGCCAACAC GACGGCCGAG GCGCTGCGGC AGGTCGGGAT CAATCCCGGC GCGCTCGGCG TACCGAGCGG TCCGACCGAC ACCGATACCG ACAACGGCCA GCCCCAGCCG ACACCGCAAC CGCAGACCCC GCAGGGCCAG CAGCCCAACT CGGCTGCCCG CGACTCCGCG GTGCGGGCGC TCGACCAGGC GTTGCAGAAG CTGCAGCAGG CGCAGACCAG CGGCGACTTC CAGGCCTACG GTGCCGCGCT CGAGGAGCTG AAACGTGCTG TGGCGCAGTA CGAGGCGACC GGGCAGGGCG GCTGA
|
Protein sequence | MSVRGPAGMP TLSRKSKIAI GVGVGILVLL LVGPRLVSII TDWLWYRDIG FTQVFSTIAW TRIILFLVTT IFVGGLVFGA IAIAYRSRPV FVPTAGPNDP LARYRTAIMG RLRWFGIVPA VIIGALAGLV AQGSWATVQM FLHGTDYGTQ DPQFGLDMGF YAFDLPFYRF ILNLLFVVVV IAFIANLVTH YLFGGLRLGG GGGSLTTAAR VQLAVLAGTF LLLKAIAYWF DRYTLLTSDR KADIFPGAGY TDINAVLPSK LILMSIAIIC AVAFFAGVVL RDLRIPALAT VLMLFSALLI GVGWPLAMEQ FSVKPNAAQK ESEYIERALD STRQAYGLGS DHVSYERDWT AQPANAATVN ADTNTLSNIR ILDPNVVSPT FRQQQQRRNF YGFPTQLAVD RYRVDGQLRD YVVSVRELDP SQYQGNQQNW LNKHLVFTHG DGFVAAPANR VREAADDNDI DSGGDPLYTV TDTTNVNDEN HQREAPIKVK QPRIYFGELI AKVDPDYAIV GSDNGQPREF DGGNDDGARY TYTGDAGVGL GNWFTRALYA VKFTERNFIL SSEITNASRI LYNRDPRDRV KKAAPWLTVD SKTYPAVMAD GSIKWIVDGY TTLDSYPYSQ PTSLQSATAD AQDLNPGQTG RSQVNKTVGY VRNSVKATVD AYTGDVELYQ FDTQDPVLKT WMKVFPGTVK SRADFDKNTS LREHVRYPED LFKIQRYLLT QYHVDSPQTF FQGNDRWSVP PDPTNSDAAQ RGLDQPPYYF VAASPENGQS SFQLTSVLNR LDRPILGAYV TVSSDPDNYG QITVKELPGN NQRSGPVQAF NPMKTDPRVA QSQRDLQATA TVTFGNLLTL PVGDNGILYV VPMYAQAQAA EAFPRLFRVI TRYEPAGGQA SIGYANTTAE ALRQVGINPG ALGVPSGPTD TDTDNGQPQP TPQPQTPQGQ QPNSAARDSA VRALDQALQK LQQAQTSGDF QAYGAALEEL KRAVAQYEAT GQGG
|
| |