Gene Snas_3557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3557 
Symbol 
ID8884756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3769353 
End bp3770453 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content70% 
IMG OID 
Productglycosyltransferase MGT family 
Protein accessionYP_003512312 
Protein GI291301034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC TCTTCCTCAC CGCCGATCTC GGTGGGAACG TGCCGCCGAC GCTCGCCGTG 
GCCGAAGAAC TCACCCGCCG CGGCGTCGAC GTCGAAGTCG CCGGTCTCAA GGACGGACAC
ACCGCGTTCC ACCAACCGCC GTTTCGCGTC GCGGTCGCTG TGGGAGTCAA GGGTCCCGCC
AAATCGCCCG GCGCCATGTT TCGGCTGCTG GCCAGCCGCA GGACCTCGGC GGAAGTGGCG
GAGTTGGTGG CCGAGCGGCG CCCGGACCTG GTGGTCGTCG ACTGCATGCT CCCGGCGCCG
ATCCGGGGCG CGCTGCGCGG CGATGTTCCG GTGGTCGTGC TCTTTCACAC CTTCGGTGCG
TACTGGACCC GGTCGTTCGA CCGTGGTCCC TTCGGCAGAA TCCTCGCCCC GCTTGGACTG
CGGCCGAGTC GGCTGTGGGC CCGGGCCGCC GCCCGGCTGC TGCTGACCGA CGCGGAGTTG
GACCCGGGCC GCGACGACCC CGCGCTCGCG GGCAGCGTAT GGACCGGAAC GACCGAGAAG
GGTGAGCAGC AGCTCCCCCG ACAGGACGGT GCGCGGCCTC GGGTGCTGGT GGCGCTCAGC
TCCACCAACT GGCCGGGCAT GTTGCCGGTC TACCGCAGGA TCGTCGCAGC CCTGTCCGAG
CTGCCGGTCG ACGCGGTGGT GACGACCGGC GGTGTCGACC TGGGCGCAGA GCTGGACGGT
GCGGCCAATG TGGAGATACT TGGTTGGGCC GACCACGGCG CGCTGTTGCC GACAATGGAC
CTCATGATCG GCCACGGCGG ACACTCCTCG ACAATGAAGT CGTTGGCTCA CGGTGTGCCG
CTACTGGTGC TTCCGATCAA CCCCACCGCC GACCAGCGCC TCATCGGCCA GACCCTCACC
GATGCCGGGC TGGGCGCATG GCTCCCGAAG TCCGCCGCCC CCGAGAAGAT CCGCGACGCC
ACCCGGCGGA TCCTCGCCGA CGGCGAACTG CGTGCCCGCA TCGCCGCCAC CGGCGACCGC
TTCCGCGCCC ACACCCCCGG ATCCCAGATC GCCGCCGACG CGCTCATCGC CGTCTCGACA
CGCGACCCAC ATCCACTGTA A
 
Protein sequence
MKILFLTADL GGNVPPTLAV AEELTRRGVD VEVAGLKDGH TAFHQPPFRV AVAVGVKGPA 
KSPGAMFRLL ASRRTSAEVA ELVAERRPDL VVVDCMLPAP IRGALRGDVP VVVLFHTFGA
YWTRSFDRGP FGRILAPLGL RPSRLWARAA ARLLLTDAEL DPGRDDPALA GSVWTGTTEK
GEQQLPRQDG ARPRVLVALS STNWPGMLPV YRRIVAALSE LPVDAVVTTG GVDLGAELDG
AANVEILGWA DHGALLPTMD LMIGHGGHSS TMKSLAHGVP LLVLPINPTA DQRLIGQTLT
DAGLGAWLPK SAAPEKIRDA TRRILADGEL RARIAATGDR FRAHTPGSQI AADALIAVST
RDPHPL