Gene Oter_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOter_2047 
Symbol 
ID6206212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOpitutus terrae PB90-1 
KingdomBacteria 
Replicon accessionNC_010571 
Strand
Start bp2627829 
End bp2628962 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content67% 
IMG OID641691698 
Productarabinogalactan endo-1,4-beta-galactosidase 
Protein accessionYP_001818930 
Protein GI182413864 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.074716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.58115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCAGC GGCCCCCCTT CACCCGATGT GTCCGGCGCG GATTGCTTCT GGCTGGCGCG 
CTCATGCTGG CCACGGTCGG TAGGGCGGCG GCGAGCACGA TGGCGCCCGC AACGTTCCTG
ATCGGCGCCG ACGTCTCGGC GCTCTCCACC CTCGAGCGCC ACGGCGCGGT GTATCGCGAC
AGCGCTGGGC CGCGGGACGC GCTGCAGATT CTCGCGGGCG AGGGTTTCAA CTGCTATCGG
CTGCGTCTGT TTGTGGCTCC GGACGGCAAG GGAATCGTCA CCAACGATCT CGCCTACACG
CTGGCGCTGG CGCGTCGCGT GAAGGCCGCT GGCGCGACGT TAATGCTCGA CCTGCACTAC
TCCGACACGT GGGCCGACCC CGGCAAGCAG TTCAAACCCG CGGCATGGGC GGCGCTCGCG
TTTGACGATC TCGAGCAACA GGTGCGGACG TACACGCGGG AAGTGTTGGA GCGGTTCGCC
CGAGAAGGGC TGATGCCCGA TTACGTCCAA CTCGGCAACG AGATCACGAA CGGGATGCTC
TGGCCCGAGG GCCGGGTGGA GTTTGCCGAA CGAACCAATC GCGCGGGGTG GGAGCACCTC
GGCCGGCTGC TGCGCGCGGC GCATGTGGGA TTGGCGGAGG CGAGCGAAGG CCGGCCGAAG
CCGAAGAGCG TGCTGCACAT CGAGAGTCCG CATCAACGCG AGCGCACGCT CTGGTTCTGC
CGCGAGGCAC GCGCGGCCAA AGTGCCCTTC GACCTGATCG GGATGAGCTA CTATCCGGAA
TGGCACGGCG ACCTCGAGAC GCTCCGCGGG ACGCTGGTCG CGCTCGCGAC GGAGTTTCAT
CAGCCGATCA TCGTGGCCGA GACGGCCTAC CCCTGGACGT CGGATGAGCA CTGGACGGGC
CGGCCGAACC TGAACTGGCC GCTCACGCCC GAAGGCCAAC GCCAGTTTCT GCGCGCTGTG
CTGCAAGTCG TGCGCGAACT GCCCGATGGC TTGGGGCGCG GCGTGTTGTA TTGGCACCCC
GAATCGGTGC TCACGCCCGG CCAGCGGATT TGGCTCGGCG GTTCCTGTGC GCTGTTCGAC
CACGAAGGCA ACGTGCTCCC GGCGGCCCGC TTTGCAGTCC CTAACCAACC CTGA
 
Protein sequence
MLQRPPFTRC VRRGLLLAGA LMLATVGRAA ASTMAPATFL IGADVSALST LERHGAVYRD 
SAGPRDALQI LAGEGFNCYR LRLFVAPDGK GIVTNDLAYT LALARRVKAA GATLMLDLHY
SDTWADPGKQ FKPAAWAALA FDDLEQQVRT YTREVLERFA REGLMPDYVQ LGNEITNGML
WPEGRVEFAE RTNRAGWEHL GRLLRAAHVG LAEASEGRPK PKSVLHIESP HQRERTLWFC
REARAAKVPF DLIGMSYYPE WHGDLETLRG TLVALATEFH QPIIVAETAY PWTSDEHWTG
RPNLNWPLTP EGQRQFLRAV LQVVRELPDG LGRGVLYWHP ESVLTPGQRI WLGGSCALFD
HEGNVLPAAR FAVPNQP