Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3279 |
Symbol | |
ID | 8754960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 3449525 |
End bp | 3450835 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | Cupin 4 family protein |
Protein accession | YP_003410255 |
Protein GI | 284991701 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGACCGAGG ACCCCACCGC CCAGGGCCGG CCGGGCGACC GGCCGGCCCT GGGTCGGTGC ACCGCTCTCG ACCCGCGGGT CTTCGCCGAG GAGTACTGGG CCCGCCGGCC GCTGCTGACC CGCGCCGAGG AGACGGGCGG CTCCTTCGCC GACCTGCTCG ACCTCGCCGC GGTCGACGAG CTGCTGTCCC GCCGCGGGCT GCGCACCCCG TTCCTGCGGA TCGCCAAGGA CGGCGCGGTC GTCGACCCCA AGCGCTTCAC CACGTCCGGC GGCGCCGGCG CCGAGGTCGC CGACCAGGTC TCCTCCGACG CGGTGCTGCG GCTGTTCGCC GACGGCAGCA CCGTCGTCCT GCAGGGTTTG CACCGGCTGT GGCCGCCGCT GATCGAGTTC GCCGACCAGC TGGCCGCCGA CCTCGGCCAC CCCACGCAGG TCAACGCCTA CGTCACCCCG CCGTCCTCGC GCGGCTTCTC CCCGCACTAC GACGTCCACG ACGTCTTCGT CCTGCAGGTG GCCGGCGAGA AGCGCTGGCG CATCCACGAG CCGGTGCTGA CCGACCCGCT GCGCACCCAG CCGTGGAACG AGCGGGGCGC CGCGGTGGCC GCCGCCGCCG AGCGCGAGCC GCTGATCGAC GCGGTGCTGC GCCCCGGCGA CGCCCTCTAC CTGCCACGCG GCTACCTGCA CTCGGCGACC GCGCTCGGCG CCATCAGCGC GCACCTGACC GTCGGCATCC ACTCGGTGAC CCGGTGGGCG GCCGCGGAGT CCGCCCTGGA CCTGGTCCGC GTGCTCGCCA CGGAGGACCC GCAGCTGCGC CGCTCGCTGC CGCTGGGCGT CGACCTCGCC GACCCGGCCG CGGTCGCCGA CGACGTCGCG ACGGTCGTCA CCGCGCTGAA GGGCTGGCTG GACCGCGTCG ATCCCGCCGA GGTCGCCGAC CGGCTGCGGG CCCGCACCTG GTCGCAGGTC CGCCCGGAGC CGGTGGCCCC GCTGGCCCAG GCGACCGCGG CGGCCGCCCT CTCCCCCGAC ACCGTGCTCC GGCTGCGCCG CCGCCTGCGC TGCCAGCTGC GCGAGGCCGC CGACGGACGG GTCACCCTGG TCGCGGGACG ACGGTCCCTG GAGCTGCCCG CCGAGACGCG GCCGGCGGTG GCCGGGCTGC TGGCCGCCGG CGAGCTCAAG GTCGCCGACC TGCCCGGGCT CGACCCCGCC GACCAGCTCA CCCTGGGTCG GCGGCTGGTG ACCGAGTCGA TCGCCACCGT GCCCGGTGCG ACCGCGGAGG ACCATGGGGG GCGTGAGCGC GCGCCCGGGA ACGGATCGTG A
|
Protein sequence | MTEDPTAQGR PGDRPALGRC TALDPRVFAE EYWARRPLLT RAEETGGSFA DLLDLAAVDE LLSRRGLRTP FLRIAKDGAV VDPKRFTTSG GAGAEVADQV SSDAVLRLFA DGSTVVLQGL HRLWPPLIEF ADQLAADLGH PTQVNAYVTP PSSRGFSPHY DVHDVFVLQV AGEKRWRIHE PVLTDPLRTQ PWNERGAAVA AAAEREPLID AVLRPGDALY LPRGYLHSAT ALGAISAHLT VGIHSVTRWA AAESALDLVR VLATEDPQLR RSLPLGVDLA DPAAVADDVA TVVTALKGWL DRVDPAEVAD RLRARTWSQV RPEPVAPLAQ ATAAAALSPD TVLRLRRRLR CQLREAADGR VTLVAGRRSL ELPAETRPAV AGLLAAGELK VADLPGLDPA DQLTLGRRLV TESIATVPGA TAEDHGGRER APGNGS
|
| |