Gene Snas_1015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1015 
Symbol 
ID8882200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1076671 
End bp1078101 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content74% 
IMG OID 
Productcarbohydrate kinase YjeF related protein 
Protein accessionYP_003509818 
Protein GI291298540 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00295738 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCGGAG TGTGGAAGGT CGACCAGGTC CGAACGGCCG AGGCCGAGCT GATGAAGCGA 
CTGCCCGAGG GCGCGCTGAT GAAGCGCGCG GCGGCGGGCC TGGCGGCGCG GTGCGCGCGG
CTGCTGCACG GCATCGGCCT GTACGGCTCG GCCGTGACGG TGCTGGCCGG CAGCGGCGAC
AACGGCGGCG ACGCCCTGTA CGCGGGCGCG CTGTTGGCCC GGCGCGGCGC GGCGGTCACC
GCCGTGGAGA TCTTCCCGGG CCGCACCCAC GCCGCCGCGC TGGCGGAGTT CCGTGCGGCT
GGCGGCCGGG TCACCGACAC CACTCCCGAG CACCACGATC TCGTCATCGA CGGGATCCTG
GGCATCGGCG GCCGTCCCGG GCTGCCCGAC AACGCCGCGA TCATGCTGTC GCGGATGGGC
CGGGTGCTGA CCGTCGCCGT CGACGTCCCC AGTGGCGTCG ACGTGGACAC CGGCGCCGCG
GTCGCGCAGG CGGTGCGCGC CGACGTCACC GTCACCTTCG GCTGCCTCAA ACCCGCGCTC
GCGGTGGGTG CGGCGGCGGC GCTGGCCGGG ATCGTCGACT GTGTCGACAT CGGACTGGGT
CCGTTCCTGC CCGAGCCCTA CGCCAAGGTC CCCGAACTGT CCGACGTGCG GTCGTGGTGG
CCGCACGCCC GCCCCCACGA CGACAAGTAC ACCCGCGGCG TGCTGGGAGT GTGCGCCGGT
TCGTTCCGCT ACCCCGGCGC CGGGGAACTG GCCACCGCCG GGGCACTGGC GGGCCCGGCC
GGTTACATCC GTTACGCGGG AACGGCTTCC CGCCACATCC GCTACAGCTA CCCCGAAGTG
GTCACCAAGG ACCGGGTCGC CGACGCCGGA CGGGTGCAGG CCTGGACGGT GGGGCCCGGC
ATGGGCACCG ACTCGCAGGC CGCCTCCCAG CTGGCCTCGG CCATGGCCGC CCCGGTTCCC
ATGTGCATCG ACGCCGACGC GCTCACCCTG ATCTCCGACG AACCCGAGGC ACTGTACGAG
CGGCAGTCGC CCAGCGTCAT CACCCCGCAC GACCGCGAGT TCTCCCGCCT GAGTGGACGG
ACCCCGGGCG ACGACCGGGC CGCCGACGCC CTCGACCTGG CGCAGCGGCT GGACTGCATC
GTGCTGCTGA AGGGCTACCG CACCATCATC GCCAACTCCA ACGGCGACCT GTACTTCAAC
CCGACTGGCG ACCCGAGCCT GGCCACCGCC GGTTCCGGGG ACGTGCTGGC GGGACTGCTC
GGCGCGATGC TGGCCGCCGG GGTGCCGCCG GAGCGCGCGG CGATGTCGGC GGCGTTCCTG
CACGGCCTGG CCGGACGCGA GGCCGCCAAC CACGGCCCCG TCACCGCCTC GGGCATAGCC
AAGGCCCTGC CGACAGCGAT CAGCAGGGCC CTGGGTACAG CCGTCACATA G
 
Protein sequence
MRGVWKVDQV RTAEAELMKR LPEGALMKRA AAGLAARCAR LLHGIGLYGS AVTVLAGSGD 
NGGDALYAGA LLARRGAAVT AVEIFPGRTH AAALAEFRAA GGRVTDTTPE HHDLVIDGIL
GIGGRPGLPD NAAIMLSRMG RVLTVAVDVP SGVDVDTGAA VAQAVRADVT VTFGCLKPAL
AVGAAAALAG IVDCVDIGLG PFLPEPYAKV PELSDVRSWW PHARPHDDKY TRGVLGVCAG
SFRYPGAGEL ATAGALAGPA GYIRYAGTAS RHIRYSYPEV VTKDRVADAG RVQAWTVGPG
MGTDSQAASQ LASAMAAPVP MCIDADALTL ISDEPEALYE RQSPSVITPH DREFSRLSGR
TPGDDRAADA LDLAQRLDCI VLLKGYRTII ANSNGDLYFN PTGDPSLATA GSGDVLAGLL
GAMLAAGVPP ERAAMSAAFL HGLAGREAAN HGPVTASGIA KALPTAISRA LGTAVT