Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gbro_4793 |
Symbol | |
ID | 8554175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gordonia bronchialis DSM 43247 |
Kingdom | Bacteria |
Replicon accession | NC_013441 |
Strand | + |
Start bp | 5130146 |
End bp | 5132221 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | protein of unknown function DUF477 |
Protein accession | YP_003275801 |
Protein GI | 262204593 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.100382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGTCG TTCGGACGTC ACCCCGGCAC GCGGCCCTGA CCGGCCGGAT TCTGCTCGTC GAACTCCTCG GCGCGATGGC CTTGCTGTTC TCGACCCTGC TGTTCTCGAC CCTGCTGTTC TCGACCCTCC TGGGCGCCGG ACCGTCGTCG GCCGAAGCAC CGGTCAACAT GGCGACCCAC GTGGTGGATT CGGCCGACGC ACTCACCCCC GCCCAGGAGG CCGAGGTCAC CGAACGCATC GATCGGCTCT ACCGCGATCA GGGCATTCAG TTGTGGGTGG TCTACGTCCG GGACTTCAAC GGACTCGACC CGGCGCAGTG GGGTAACCGC ACCGCCACGA GCAGCGGCCT CGGCAGCAAG GATGTTCTGC TGTCGGTGGC CACCGACTAC CAGTCCTACG ACCTGTTCCC GATCGACCCC GATGCGGTGG GGCTCGATCA GTCCGACCTC GACACCATCG CCAACGACCT CGTCCTGCCC GCCCTCAAGA GCCGGGACTG GGCGGGTGCG GCAATCGCCG CCGCCGACGG CATCGACGCG GTCAAGAAAC CCTCCTACAC CGGCATCATC ATCGCCGGCG CGGTCGGTGG CGCCGCCGTC ATCGGCGGGG GCGGTGCGCT GGTCTACCGC CGTCGGCGCA AGCGCCGGCA GATCGAGGAT GATCTGTCGC ATCTGCGGGA GAACGAGCTG ACCGTCGACC AGCTCGCCGC ACAGCCCCTC GACGTCCTGC ACCCGTGGTC CCGAGAGGTG CTGACCGACA CCGACAACGC GATCCAGACC AGTGCCGAGG AACTCCGGCT CGCGGTGAGC GAGTTCGGCG AAGCCCAGTG CGCCCCGTTC ACGCAGGCGC TGGACAAGGC CCGCCGCGGA CTGGCCGACT CGTTCGCACT GCGGCAACGC CTCGACGACA ACGTGCCGGA GACCGCCGAC GAACAACGCT CGATGCTCGT GCAGATCATC ACCACCTGCA CCGACGTCGA CAACGACCTC GACGCACAGG TGACCGCCTT CGACGAGATG CGCAACCTGC TGATCAACGC CGACACCCGA TTCGACGAGA TCACCCGGCA GCTCGTCGCA CTGCGGGCCC GACTGGAGCC GGCGGCGTCG AAGCTGGACG CGCTGATCGC CAAACACGGC CGGCAGACCC TGGCGTCGGT CCTGCACAAC GTCGACCTGG CCGGCGACCA GATCGCCTTC GCCGAGGCGA GCGCTGATCA GGGCCGCGCC GCGATCACCG CGCCTGCCGG CCAACAGGGA CCCGCGGTCG CCGACATCCG CTCGGCCGAG GGCTCCATCG AGCAGGCGAG CCGGCTACTC GACGCCATCG ACCACGCCGA CGACAACATC GCCGCCGCGC ATTCGCGGAT GCCCGCGCTC ATCGCCGAGG TGGAGGGCGA GCTGACCGAG GCGGCGGGCC TGTCCACCGA CGGCGGGCCC GGCCTGGCGA CCGCGGTGGC AGCCGCCCAA CAGGCGGTGG CGGCCGCCCG CGACAACTTC GACACCGATC CGCTCGGCGT GTTCACCGCG CTGGTGGACG CCGACGCCGA GCTCGACGAC GCGCTCGACG CGGCCCGTGC GGCCTCCGCC GAACGCACTC GCCGCACCGA GATGGTCACG GCCGCCATCG AATCCGCACA GGCAAAGGTC TCCGCGGCCG ACGATTTCAT CTCGACACGG CGCGGCGCCG TGCAGGCCAC CGCACGCACC CGGCTGGCCG AGGCCCAACG GCTGCTCGCG AGCGCCCAGA CGTCGGCCAC CGGTGACCCC CTCGCGGCCG CCGATGCCGC ACGACGGGCG GGCGCGCTGG CCGATCAGGC ACTGATGGCC GCGCAGGGCG ATGTGGTCGG CTGGCAGCAG ACCCAGCAGC CCCGCTCCGA CGGCGCGTCG GCAGCCGGCG CGGTCCTCGG CGGCATCCTC GTCGACAGCT TCCTGCGCGG CACGATGGCC GGCCGCGGTC ACGGTGGTGG TTTCGGGGGT GGGTTCGGCG GCGGTTTCGG CAGCGGTGGC CGCAGTCCCG GTTCCTTCGG CGGGTCGGGC AGTTCCGGAC GAATCGGGGT CGGCGGCCGG TTCTGA
|
Protein sequence | MHVVRTSPRH AALTGRILLV ELLGAMALLF STLLFSTLLF STLLGAGPSS AEAPVNMATH VVDSADALTP AQEAEVTERI DRLYRDQGIQ LWVVYVRDFN GLDPAQWGNR TATSSGLGSK DVLLSVATDY QSYDLFPIDP DAVGLDQSDL DTIANDLVLP ALKSRDWAGA AIAAADGIDA VKKPSYTGII IAGAVGGAAV IGGGGALVYR RRRKRRQIED DLSHLRENEL TVDQLAAQPL DVLHPWSREV LTDTDNAIQT SAEELRLAVS EFGEAQCAPF TQALDKARRG LADSFALRQR LDDNVPETAD EQRSMLVQII TTCTDVDNDL DAQVTAFDEM RNLLINADTR FDEITRQLVA LRARLEPAAS KLDALIAKHG RQTLASVLHN VDLAGDQIAF AEASADQGRA AITAPAGQQG PAVADIRSAE GSIEQASRLL DAIDHADDNI AAAHSRMPAL IAEVEGELTE AAGLSTDGGP GLATAVAAAQ QAVAAARDNF DTDPLGVFTA LVDADAELDD ALDAARAASA ERTRRTEMVT AAIESAQAKV SAADDFISTR RGAVQATART RLAEAQRLLA SAQTSATGDP LAAADAARRA GALADQALMA AQGDVVGWQQ TQQPRSDGAS AAGAVLGGIL VDSFLRGTMA GRGHGGGFGG GFGGGFGSGG RSPGSFGGSG SSGRIGVGGR F
|
| |