Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_5081 |
Symbol | |
ID | 8756783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 5304527 |
End bp | 5306881 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 80% |
IMG OID | |
Product | glycoprotein |
Protein accession | YP_003411979 |
Protein GI | 284993424 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0297012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCGGC GGTCGGCTGC CGGGGGCGCC GGCGGCCGGT CCGCCGCGTC TTCCCCCCGC CGCACCGCCC GGGGGACGAC GGTCGGCCTC GCCCTGGCCG GGGCCCTCAC GGCTCCCCTC GTCGTGCTGG CGCCGTCCGC GGCCGCCGCG CCGGAGGACC CCGCTCCCGC CCCGGGGGCC GCCGAGGACC GCCCCGTGAG CGTGGAGCTC ACCCGGCTGG ATCCCCGCAC GCTCGCCCCG GGCGCCCCGG TCACCATCGC CGGCGAGCTG ACCAACACCG GCACGGAGAC GATCGAGGAC CTCGACGTGC GGCTGCAGCG CGGCGGGCCG CTGGCCACCC GCGCCGAGCT GCAGGCGGTC GACGGCGCCC GCGACCTGAC GGCCGTCGTC GCCACGCCGT TCCAGGACGT CGCCGACGAG CTGCGCCCCG GCGCCTCGGC GTCCTTCACC CTCACCACGA CCACCGACGC GCTGGCCATC GACCGGGACG GCGTCTACCC GGTGCTGCTC AACCTCAACG GCGTCGGGTC CGACGGGGAC CGCCGCCGCG TGGGGGAGCT GACCACCTAC CTCGTCCAGC CGTCGGTGCT GCCGCCCACG TCGGCCGGCG TCGCCTGGCT GTGGCCGCTG GTCGAGCGCA CCCACCGCGA CGCGTCCGGC GCCTTCGTCG ACGACGAGCT GACCGGCGAG GTGGCTCCCG GCGGCCGGCT GGACCGGGCG CTGGCCACCG TGGAGCGGCT GCCCGAGACC GTGCCGCCCG GGGGCGGCCA GCCCACGCCC GTGGTCCCGG TGACCCTCGC CGTCGACCCC GCGCTGGTCG AGGAGCTGCA GGTCATGGCC GACGGCCCCT ACGCCGTCGC CGGCGACGAG GGCGCCGGGG CGGGCACCGA CGAGGCCGCC GCCTTCCTGG CGCGGCTGCG CGCCGTCGCC GCCGACCACC CGGTCGTCGC GCTGCCCTAC GGCGACGTCG ACGCCGACGC CCTGACCGCG GCCGGGCTAA CACCGGTCGT CACCCGCAGC CTGCCCGGCA GCCCCGAGGG GACGGCGCAG GACGACCCCG ACGCCCCCAC CGCCCCGCCG ACCGCGGTGC CCACCGGGAC CGGGGGCACC GCCGGCGCGG ACGTCCTCCG GGAGGTCCTG GGAGTGGCCG CGCGGATCGA CCTCGCCTGG GCGGTCGGCG GCTCGCTGCG CCCGGAGACG TTCGGCGTCC TGCAGGACGG CGGCGTCCGG GAGGTCGTGC TGGCCCCCGG CGGGCTCAGC GACGGGGCGG CCGCGCTCGG CCTGGACGGC CCGGCCGCGG CCCGGGCGAC CGTGCCCACC GTCGACGGCT ACGTCGACGC CCTGGTCGCC GACCCCGCGC TCGGCCGGCT CGCCGGCAAC GCGGCGCAGG CCCCGGGGGG CCGCCGCGCC GCCGAGCAGC GCACCGTCGC CGAGCTCGCG CTCCTCACGC TCCAGCCGGG CGACGCGGCC GGAGGGCAGA GCGTGCTGGT GGCCCCGCCC CGCGAGGTCG ACGCCGACCC CGCCGCGGTG AGCGCGATGA TGGCCGCCGC GGCACAGCTG CCGGGGCTGC GGCCGGCGAC CGTGGCCCAG CTCGGCGACG GGCCGGCCAC CGACGCCGGC GAGCTGGTCG CCCCGGCCGA CCCGGGCGGG CTGGAACCGG CCGGCCTGAC CGACGTCACC GCCGCGGTCG CCGTCCGCGA CCAGCTCGCC GGGGCGGTCG TCGGGGACGC CGACGCGGTG CTGGCGCCCT GGGACGCCGC GATCGCGCGC GCCACCTCGG CCGCCTGGCG GGACGACCCC GCGGCGTTCA CCGCGGCCGC CGCGGACCTG CGGACGACGA TGGGCGGGCT GCTCGACCGG GTCACCCTGC TCGCCCCGGC CGCCGGCACC TACAGCCTCG CCTCCAGCGA CGCACCGCTC GTGCTCACGG TCAGCAACGA GCTGCCCTTC GCGCTCCGCG TCCAGCTGCG GGTGCAGACC CGCGGCAACG CGCTGTCGGT GGGCGACCTC GGCGACCAGG TGCTCGGGCC CGAGCAGCGG ACGACGCTGA CGGTCCCGAC CGAGGTGCGC CAGTCCGGCC GCTTCGGCGT CGCTGCCACG CTCACCACCC CGGACGGCGC ACCGCTGGGT GACCCCGTAC AGCTGCAGGT GCAGAGCACC GCGTACGGGT CGATCTCGGT GGTCATCACC ATCGGCGCGG CCGCGCTGCT CGGGCTGCTC TTCCTGCGCC GGCTGGTGCG CTTCCTGCTG CGCCGCCGCC GCGGCACCCC CGACGACGGG GACGGCGACC TGCCGGGCGG CCCCGCACCC GAGGGCGCCG CGGTCCCGCT GCCCCCGACG CGGAGCCCCG TGTGA
|
Protein sequence | MSRRSAAGGA GGRSAASSPR RTARGTTVGL ALAGALTAPL VVLAPSAAAA PEDPAPAPGA AEDRPVSVEL TRLDPRTLAP GAPVTIAGEL TNTGTETIED LDVRLQRGGP LATRAELQAV DGARDLTAVV ATPFQDVADE LRPGASASFT LTTTTDALAI DRDGVYPVLL NLNGVGSDGD RRRVGELTTY LVQPSVLPPT SAGVAWLWPL VERTHRDASG AFVDDELTGE VAPGGRLDRA LATVERLPET VPPGGGQPTP VVPVTLAVDP ALVEELQVMA DGPYAVAGDE GAGAGTDEAA AFLARLRAVA ADHPVVALPY GDVDADALTA AGLTPVVTRS LPGSPEGTAQ DDPDAPTAPP TAVPTGTGGT AGADVLREVL GVAARIDLAW AVGGSLRPET FGVLQDGGVR EVVLAPGGLS DGAAALGLDG PAAARATVPT VDGYVDALVA DPALGRLAGN AAQAPGGRRA AEQRTVAELA LLTLQPGDAA GGQSVLVAPP REVDADPAAV SAMMAAAAQL PGLRPATVAQ LGDGPATDAG ELVAPADPGG LEPAGLTDVT AAVAVRDQLA GAVVGDADAV LAPWDAAIAR ATSAAWRDDP AAFTAAAADL RTTMGGLLDR VTLLAPAAGT YSLASSDAPL VLTVSNELPF ALRVQLRVQT RGNALSVGDL GDQVLGPEQR TTLTVPTEVR QSGRFGVAAT LTTPDGAPLG DPVQLQVQST AYGSISVVIT IGAAALLGLL FLRRLVRFLL RRRRGTPDDG DGDLPGGPAP EGAAVPLPPT RSPV
|
| |