Gene TM1040_0905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0905 
Symbol 
ID4076275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp966827 
End bp967888 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content64% 
IMG OID638006207 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_612900 
Protein GI99080746 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTTCG GGGCGACCAT TCTGGACGCA GAAGGACTGC GTCTCACCGC CGATGAAAAG 
GCGCTGTTTC GCGAAGCAGC GCCTTTTGGC TTTATTCTCT TTGCGCGCAA TATCGAGAGC
GGCGATCAGC TGCGCGCGCT GTGCGACGAG TTTCGCGAAG CCGCCGGGCA TGACTGTCTG
ATTACGATCG ACCAGGAAGG CGGGCGCGTG CAGCGGCTTC GTGCGCCGCT GGCGCGGGAG
TTTCGCCCGG CGCTCGATCA TGTGGAGGCG GCCTCGGATC CGGTGAGGGC GATGTATCTG
CGTGCGCGCC TCATCGCGGC GGAACTGCGC GACTATGGCA TCGACAGCAA CTGTGCGCCG
CTCGCTGATG TGGCAACTGC AGAGACCCAT CCGTTCCTGC GCAATCGCTG TTACGGATCC
AACGTCGAGA GCGTCGCCGC CATTGCCCGC GCCTGTGCCG AAGGACATCT GGATGGTGGC
GTGGTGCCGG TGATGAAGCA CATCCCCGGC CATGGGCGAT CAACGATGGA CAGCCACCAT
GATCTGCCCC ATGTGACCGC CCCCGCGGAC CTTTTGCGCG CGGAGGATTT CGCGACCTTT
GCCAAGCTCC GGGATCTGCC GATGGCGATG ACGGCGCATC TGGTCTACGA CGCCTTTGAT
CCGCGCCCGG CGACGCTTTC GCCGGTCATG ATGGAGATCA TCCGCAATGA GCTGGGCTAT
GATGGTCTGG TGATGACCGA CGACATCTCG ATGAAGGCCC TATGGCAGGA CGCGGCCCAA
GATGAGGCGC GCACCGCTGG CGGGCCATAT CCCGACGACG AAGTGACCTG GAGTGCCCGA
TTGGCGCGTC AGTCCCTTGA TGCCGGCTGT GATGTGGCGC TTTTCTGCAA TTCGTCACTG
GCGGCACGCG CCGAGGTTGT CGCGTCCGCT GGGCAGATGA CTAAGGCCGC TCAACGGCGC
GCCGAGGCGG CCCTGGCCTG CCGCAAGCCC CCTGTCGTGC TTGACACTGA GGCCGTGGAA
GCGGAACTAT CTGCCCTCAT GGGCGGACAG GTTTATGGCT GA
 
Protein sequence
MRFGATILDA EGLRLTADEK ALFREAAPFG FILFARNIES GDQLRALCDE FREAAGHDCL 
ITIDQEGGRV QRLRAPLARE FRPALDHVEA ASDPVRAMYL RARLIAAELR DYGIDSNCAP
LADVATAETH PFLRNRCYGS NVESVAAIAR ACAEGHLDGG VVPVMKHIPG HGRSTMDSHH
DLPHVTAPAD LLRAEDFATF AKLRDLPMAM TAHLVYDAFD PRPATLSPVM MEIIRNELGY
DGLVMTDDIS MKALWQDAAQ DEARTAGGPY PDDEVTWSAR LARQSLDAGC DVALFCNSSL
AARAEVVASA GQMTKAAQRR AEAALACRKP PVVLDTEAVE AELSALMGGQ VYG