Gene Sala_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1140 
Symbol 
ID4080885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1178013 
End bp1179020 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content68% 
IMG OID638009501 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_616189 
Protein GI103486628 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.777331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCCG CGATTTTCGG CCTGTCCGGG CTGACCCTGA CCGACGATGA ACGAGCCTTT 
TTTCGCGATT GCGCGCCCGC GGGCTATATC CTGTTCGGAC GGAATATCGA AAACCGCGAC
CAGCTCCGCC GCCTGACCGA CGAGTTGCGC AGCCTCGACG GGCGGGCCAA TCTGCCGATT
CTGATCGATC AGGAGGGCGG CCGGGTCGCA CGCATGAAAG AACCCGAATG GCCTGCTTTC
CCCAGCGGGG CGGCCTTTGA CGCGCTTTAT GAGCGCGCGC CCGCCAGCGC GATCGAGGCG
GCGCGGCTCA ATGCCATGGC GCTCGCGGCG ATGCTCGCCG AAGTCGGCAT CACGGTCGAT
TGCCTACCGC TGCTCGACGT GCGCCAGCCG GGCGCGAGCG ACGTGATCGG CGACCGCGCG
CTCGGCAGCG AGCCGATGCG CGTCGCCGCG CTCGGCCGCG CCATTCTGAG CGGTTTGCAG
GCGGGCGGCG TCGTCGGCAT CGTCAAACAT ATCCCCGGCC ACGGCCGCGC GCTGCTCGAC
ACGCACGAAG CCCTGCCGAC GGTCACCGCC TCCGACCGCG AACTGCAGAC CGATCTTGCT
CCGTTCGCGG CGCTCCGCGA TGCGGCGATG GCGATGACCT GTCACGTCAT TTTTGCGGCG
TGGGATCCCG ACCGGCCCGC GACCCTGTCG CCGACCGTCA TCGACAGCGT GATCCGCCAG
CGGATCGGTT TCCATGGGCT GCTGATGACC GACGATCTCG ACATGAAGGC GCTGTCGGGC
AACGTGCCCT CGCGCGCGGC GCAGGCGATC GCGGCAGGGT GCGACATCGC GCTCAATTGC
TGGGCGCGGA TGGATGACAT GATCGGCATC GCCAACCGGC TCGATCCGAT CAGCACGGTC
TCGCGCGCGC GGCTTGAAGG CGCGATGGAC CGGATCGCGG GCGCGCGTGA CGAAGGCGAG
TTCGCCGCGC TCGTCGATCA GCGCGATGCG CTGCTGGCGA TGGTCTGA
 
Protein sequence
MIPAIFGLSG LTLTDDERAF FRDCAPAGYI LFGRNIENRD QLRRLTDELR SLDGRANLPI 
LIDQEGGRVA RMKEPEWPAF PSGAAFDALY ERAPASAIEA ARLNAMALAA MLAEVGITVD
CLPLLDVRQP GASDVIGDRA LGSEPMRVAA LGRAILSGLQ AGGVVGIVKH IPGHGRALLD
THEALPTVTA SDRELQTDLA PFAALRDAAM AMTCHVIFAA WDPDRPATLS PTVIDSVIRQ
RIGFHGLLMT DDLDMKALSG NVPSRAAQAI AAGCDIALNC WARMDDMIGI ANRLDPISTV
SRARLEGAMD RIAGARDEGE FAALVDQRDA LLAMV