Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Oter_0119 |
Symbol | |
ID | 6204365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Opitutus terrae PB90-1 |
Kingdom | Bacteria |
Replicon accession | NC_010571 |
Strand | - |
Start bp | 143023 |
End bp | 144120 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641689740 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001817010 |
Protein GI | 182411944 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.432881 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGGC GTGAATTTCT GAAATCCTCC GCCGCCGCGG CGGCGGGCGT CCTTTTGTCC TCTCGCGTGA GCGCGGCGGA GGCGGCGCGG CGGAGCCAGC CGCGGATCCC GCGTTGGCGC GGCTTCAACC TCACGGAACT CGCCAGCGGT CGGCGCGGGC AGGCGTTTGT GGAGTCAGAC TTCGCGTGGA TGGCCGAGTG GGGTTTTGAT TTCGCCCGGC TGCCGATGTC GTACTGGTCG TGGTCCAGTG CCAATGACTG GATGAAAATC GAGGAGGATG CGCTGGCACC GGTCGATCAG GCGATCGAGT GGGCGGCGCG ATACGGCATC CACGTCAACC TGAACTTCCA CCGGATCCCG GGCTACTGCG TGAACGGCCG GGAGCGAGAG CCGTTTCAGC TCTTCGACAG TCCGCACGAT GCGATGGCGC GGGCGCTGGA GGCCGCCGTG CATCACTGGC GTACGTTCGC CCAGCGTTAC CGTGATATCC CGAGTTCGCG GCTCAGCTTC GATTTATTCA ACGAGCCGCC ATTCATGGCG GACCACGACC GCTATGTCGA AGTGGCGCGC GCGCTGGTCG GGGCCATTCG CGAGGTTTCT CCGGACCGGC TGATCTTCGC GGATGGAGCC GACATCGGTC AGACGCCCGT GCCGGGGCTG GTCGAGTTGG GACTGGTGCA AAGCACGCGA GGCTATCTCC CGAAAATGGT CAGCCATTAC ACGGCAACCT GGGTGCCGAA AAACGAGTTC GAGTCATTTG AGACGCCCAC CTGGCCGATG GTCGCGGCGA ATGGCGAGCG CTGGGATCGC GAACGGCTGC GCCGCGAATT GATCGACAAA TGGCAGCCGC TTGTCGCGCA GGGTGTCCCG GTGCACGTAG GCGAATGGGG CTGCTTCATC AAGACCCCGC ACGCCGTGGC GCTGCGGTGG ATGACTGATC TGCTGTCGCT TTGGAAAGAA GCCGGTTGGG GTTGGGCGAT GTGGAATCTG CGCGGCGGCT TCGGAATCGT CGACAGCGGC CGCGCCGACG TGAAGTACGA GAGTTTTCAC GGCCACCAGC TCGACCGTGC GATGCTGGAA TTGCTGCTCG CGCACTAG
|
Protein sequence | MNRREFLKSS AAAAAGVLLS SRVSAAEAAR RSQPRIPRWR GFNLTELASG RRGQAFVESD FAWMAEWGFD FARLPMSYWS WSSANDWMKI EEDALAPVDQ AIEWAARYGI HVNLNFHRIP GYCVNGRERE PFQLFDSPHD AMARALEAAV HHWRTFAQRY RDIPSSRLSF DLFNEPPFMA DHDRYVEVAR ALVGAIREVS PDRLIFADGA DIGQTPVPGL VELGLVQSTR GYLPKMVSHY TATWVPKNEF ESFETPTWPM VAANGERWDR ERLRRELIDK WQPLVAQGVP VHVGEWGCFI KTPHAVALRW MTDLLSLWKE AGWGWAMWNL RGGFGIVDSG RADVKYESFH GHQLDRAMLE LLLAH
|
| |