Gene Snas_2963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2963 
Symbol 
ID8884162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3125966 
End bp3128329 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content69% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003511731 
Protein GI291300453 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0080512 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000919293 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACGG AACTGCACCA CCCGACGCGA CAGCGCACCG CGGTCGCCGC CGCCATCGAC 
GGCGTCGCCG TCGTCATGCC CGCCTACGGC GAGGAGGCCA ACCTCGCCGC GACCGTCACC
GACTTCCTGG ACACACTGGA CGACGCCAGG ATCCCGCACG CCGTGGTCGT GGTCAACGAC
GGCAGCACCG ACCGCACCGG CGACGTCCTC GACTCACTGG CCAGACGCTA TCCCGGCCGG
GTCGTGGCCG TGCACCACGA CCGCAACCGC GGCTACGGCG CGGCGGTGCG TTCCGGCCTG
GACGCCGCGT TGCGGCACAC CGAACTGGGC CAGATCCTGC TCACCGACTC CGACCGGCAG
TTCCACGCCG CCGATCTGCT GGAGTTGCGA CGCCGCAAGG CCACCGAACG CGCCGACGCC
ATCCTGGGTT ACCGGGAGCG CCGCGCCGAC CCGTGGCATC GCCGCCTCAA CGCCCGAGTG
TGGACACTGC TGTGCAAGAC GCTGCTGCGG CTGCCCGGCC GCGACGTCGA CTGCGCCTAC
AAACTCATCG ACCGCCGCCT GCTGGAGGAA CTGAGCCTCA CCGGTGAGGC CGCGGCGATC
TCACCCGAAC TGGTCAGCCG GATCACCGTG GACGGCAACC GGGTTGTGGA ACACCCGGTG
CGGCACTATC CGCGCACCCA CGGCGAACAG ACCGGCGCGC GGCTGTCGGT GGTGGCGCGG
TCGCTGCTGA GCCTGGCCGG GGTCTACGCC AGACTCGTCC GCGACGCCCA CCGGCTGGTG
TGGCTGCGAC GGCTGGCACG TCCGGCCAAC CCGACCGCCG CCGTCGTCAC CATCCTGGCG
GCACTGCTGG CGGCCGCCGC CTATGTGTTC TATCTGGACC AGGGCGTTTT ACTGGCCTAC
AAGGACGCCA ACTCGCACCT GCTGATCGCG CGGCGCGTCG TGGCCAGTCC CACCGCCGGA
CTGGCCCAAC TGGGCGGGGT GTGGCTGCCG CTGCCGCACC TGCTGTCGGC GCCGCTGGCG
GCCTGGGAAT CCTGGTACCT CAACGGTTTC GCCGGGGCCC TGATCTCGAT GATCTCGTTC
GTGCTGTGCG TGCGCTACCT GTACCTGCTG GGGGAGACGC TGACCCGCAC CCGGCTGGGC
GGCCTGGTCA CGGCGGCGCT GTTCGCGGGC AACGCCAACA TCCTGTACCT CCAGGCCACC
GCGATGACCG AACTGCCGCT GTTCGCGTGC CTGGCCGCCG CGACTTACCA CCTCGACCAA
TGGTGCCGGG CCGCCCGCAG CCGAGACCTG GCGGCCGCCT CGGTGGCGGT GCTGGCCGCG
ACCCTGACCC GCTACGAGGG CTGGGTGTTC TGCGCCGCGG CACTGGTGAT CGTGATCTAT
GTGGAGCTGC GGCGGCACCG CAGCTACTCC AGGACCGAGT CCTCACTGGT CATCTTCGGA
TTCCTGGCCT GCGCGGGAAT CGCGGCCTGG CTGGTGTGGA ACCTGGTGAT CTTCGCCGAC
CCGCTGTACT GGCAGCGCGG CGAGTACGCC GAGTCCTCCC TGTGGGTGGA CGGCAACGAC
GTCAACGTCG GCGAACTGGA CATCGCCTCG GGCACCTACG GACTGGCGAC GTACCACAAC
ATCGGGCTCG TCACGCTCGC GATCGCGGCG GCGGGGATGC TGCTGTACCT GATCAGACAC
CGGATCCGAC GCGAATGGGT GGCGATCTAC CCGCTGCTGG GCTTCGGACC GTTCTTCGTG
GTCGCGCTGT ACACCGGGCA GCGACCGCTC AACGTCGTGG AGTACACCGG CCACTTCTAC
AACGTCCGCT TCGGACTCAT CATGCTGCTG CCCGCCGCCG TCTTCGCCGG TTACCTGATC
TCCGTGGGCG TCAGCGCGGT ACGGCGGTTC CGGTTCCGGT CACTGCGGTG GACGGCGGTG
GCCGTCGCCG TCCTGGTAAC CGCGTTCGGC ATCGCCACCG TGCCGGGGGT GGTCACCCTT
CGCGACGCGG TGGAGTTCCG CGAGACCGAC TGGGAGAACG CCGCGACCGC GCGCTGGCTG
CGCGACAACC ACGACGGCGA CCTGACCCTC ATGATGTCGT TCGCGAACGA GTCGGTCACC
TACGACTCCC GCATCCCCAC CGAATCGCTG GTCTACGAGG GAACCTACCG GCTGTGGGAA
CGCGCACTGG CCGACCCGCA CGCGCAGGGC ATCGAATGGA TCTACATGAG AGCCCTTCCC
GACGCCGAGG ACCTGGTGTG GCACGCGCTG TACGACACGC CGCAACTGGA GGACCACTAC
ACGCTGGTCT ACGAGTTCCA CGACCGGCTG GTGTTCCGGT CCACAAAGGC ACTGACCGAG
GAGTCGGAGG GCGAGGATGA CTGA
 
Protein sequence
MTTELHHPTR QRTAVAAAID GVAVVMPAYG EEANLAATVT DFLDTLDDAR IPHAVVVVND 
GSTDRTGDVL DSLARRYPGR VVAVHHDRNR GYGAAVRSGL DAALRHTELG QILLTDSDRQ
FHAADLLELR RRKATERADA ILGYRERRAD PWHRRLNARV WTLLCKTLLR LPGRDVDCAY
KLIDRRLLEE LSLTGEAAAI SPELVSRITV DGNRVVEHPV RHYPRTHGEQ TGARLSVVAR
SLLSLAGVYA RLVRDAHRLV WLRRLARPAN PTAAVVTILA ALLAAAAYVF YLDQGVLLAY
KDANSHLLIA RRVVASPTAG LAQLGGVWLP LPHLLSAPLA AWESWYLNGF AGALISMISF
VLCVRYLYLL GETLTRTRLG GLVTAALFAG NANILYLQAT AMTELPLFAC LAAATYHLDQ
WCRAARSRDL AAASVAVLAA TLTRYEGWVF CAAALVIVIY VELRRHRSYS RTESSLVIFG
FLACAGIAAW LVWNLVIFAD PLYWQRGEYA ESSLWVDGND VNVGELDIAS GTYGLATYHN
IGLVTLAIAA AGMLLYLIRH RIRREWVAIY PLLGFGPFFV VALYTGQRPL NVVEYTGHFY
NVRFGLIMLL PAAVFAGYLI SVGVSAVRRF RFRSLRWTAV AVAVLVTAFG IATVPGVVTL
RDAVEFRETD WENAATARWL RDNHDGDLTL MMSFANESVT YDSRIPTESL VYEGTYRLWE
RALADPHAQG IEWIYMRALP DAEDLVWHAL YDTPQLEDHY TLVYEFHDRL VFRSTKALTE
ESEGEDD