Gene Snas_5252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5252 
Symbol 
ID8886461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5577952 
End bp5579202 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003513979 
Protein GI291302701 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGACA CAGCGATCAG ATCCCAGGGC CTGCCGGTCC GCGCACCGAT CCCGGCGGGC 
CACTACGAAC CGGTCATCGA CGTCGTCATA CCCGTCTACA ACGAACAGGA CGATGTGGAG
GCCAGTGTGC GACGTCTGCA CACCCATCTG GCCCGTACCT TCCCCTACGG CTACCGCATC
ACGGTCGCCG ACAACGCCAG CACCGACGCC ACCCCCGCCA TCGCGGCCCG GCTGGCCGCC
GAGCTGGCGC AGGTGGAGTT CGTCCGGCTG CCCGAGAAGG GCCGCGGCCG GGCGCTGCGC
CAGGTCTGGT CGCACTCGAC GGTGCCGGTG CTGGTGTACA TGGACGTCGA CCTGTCCACT
GACCTCAACG CGCTGTTGCC GCTGGTGGCA CCGCTCATCT CGGGGCATTC GGACCTCGCG
ATCGGCACCC GGCTGGCACG CGGGTCGCGG GTGGTGCGGG GCGGCAAACG CGAGTTCATC
TCCCGCACCT ACAACGCCAT CCTCAAGGGC GGCCTGGCGG CCGGGTTCTC CGACGCGCAG
TGCGGTTTCA AGGCGATCCG CGCCGACGTG GCCGCCGAAC TGCTGCCGCT GGTGGAGGAC
ACCGGCTGGT TCTTCGACAC CGAACTGCTG GTGCTGGCCG AACGCGCGGG ACTGCGCATC
CACGAGGTCC CGGTCGACTG GGTCGACGAC CCCGACAGCC GCGTCGACAT CGTCCGCACC
GCCGTCGACG ACCTCAAGGG AGTGTGGCGG GTGGGCCGGG CGCTGGCGTC GGGGGCGCTG
CCGCTGTCGC GGCTGCGTCG CCCGTTCGGC GACGACCCGC GCGACCGCGA GACCTCGGGC
GGCCTGGTGC GGCAGCTGCT GAGCTTCTGC GTCATCGGGA TACTCAGCAC CCTGTTCTAC
CTGGTGCTGT ACACGGTATT CCGCGACGGA CTCGGGCCAC AGGTATCCAA TATGGTGGCG
CTGTTGGTCA CCGCGATGGC CAGCACGGCC GTCAACCGCC GCTTCACCTT CGGGGTCCGG
GGACGCGACG GTGCCGTCCG GCAGCAGGCG CAGGGGCTCG CGGTGTTCGC GATCGGGCTG
ACTCTCACCA GCGGATCGCT GGCGGCACTG GAAATCGCGA GCCCGACGGC TGGCCAGACC
ACCGAACTGG CCGTACTCGT CGTGGCCAAC CTCGCGGCCT CGCTGCTGAA GTTCCTGCTG
TTTCGCGGTT GGGTCTTCCC GGCCGCCCGT ACCGAAAGTG AGGCGTCATG A
 
Protein sequence
MEDTAIRSQG LPVRAPIPAG HYEPVIDVVI PVYNEQDDVE ASVRRLHTHL ARTFPYGYRI 
TVADNASTDA TPAIAARLAA ELAQVEFVRL PEKGRGRALR QVWSHSTVPV LVYMDVDLST
DLNALLPLVA PLISGHSDLA IGTRLARGSR VVRGGKREFI SRTYNAILKG GLAAGFSDAQ
CGFKAIRADV AAELLPLVED TGWFFDTELL VLAERAGLRI HEVPVDWVDD PDSRVDIVRT
AVDDLKGVWR VGRALASGAL PLSRLRRPFG DDPRDRETSG GLVRQLLSFC VIGILSTLFY
LVLYTVFRDG LGPQVSNMVA LLVTAMASTA VNRRFTFGVR GRDGAVRQQA QGLAVFAIGL
TLTSGSLAAL EIASPTAGQT TELAVLVVAN LAASLLKFLL FRGWVFPAAR TESEAS