Gene Snas_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0844 
Symbol 
ID8882028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp892610 
End bp894733 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content71% 
IMG OID 
Productthymidylate kinase 
Protein accessionYP_003509649 
Protein GI291298371 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGCG TTTCGGATCT CAAAGCGGTC CTGCGCATCC GTCCGTTCCG GCGCCTGTGG 
CTGGTGCTGG GACTGTCGGC GACCGGGGAC TGGCTGGGTC TGCTGGCCAT GTCGCTGTTC
GCCGCGTCCC AGTTCGACAA CACCACGGCG CAGGGCGCCG CGTTCAGCCT CGTCATCGTG
GTGCGGCTGA TCCCCTCGCT GCTGCTGGGC CCGCTGGCCG GGGTGTTCGC CGACCGCTGG
GACCGCCGGA TCACCATGGC CGTGTGCGAC ACGCTGCGGT TCGCGCTGTT CGCCTCGGTT
CCGCTGGTGG CGATGTGGAC CGGCGTCGGT GTCAAGGCCG CCGGCTGGAC CGCGATCGCG
ACCTTCCTCA TCGAGGCCGT CGGCATGATG TGGATGCCCG CCAAGGAGGC CGCCGTCCCC
AACCTGCTGC CCCGCAGCCG GCTGGAGGCC GCCAACCAGC TGACGCTCGT GACCACCTAC
GGTTTCGCGC CCGTGATCGC CGCCGGGGTC ATGTCGGTGC TGGAGGCCAA CTGGATGGCC
GGGGCGCTGG GCGAGGTCGG CGACTGGGCG GCCCCGGCCG CGATCGGGCT GTACCTCAAC
GCGCTGACCT TCCTGGCCGC CGCCGCGGTG GTGTTCTTCC GGATCCCCGA GATCAGCGTG
CGCCGCAACG AACCCGGGCA GGCGCCGAAG GAACAGCGCG GCATGCTGCG CGACTTCGCC
GACGGCTGGC GCCACATGGG ACGCAACAAG ATGGTGCGCG GCCTGGTGCT GGGCATCCTG
GGCGCCTTCG CCGGAGCCGG GGTCATCATC GGCACCGGCC AGTTCTTCGC GCGCTCGCTG
GGCGGCGGCG AGGCCGCCTT CACCGTCCTG TTCGCCACCG TCTTCGTCGG TCTAGGCCTG
GGCATCGTCG CGGGTCCGGC GCTGGTGGGA CAGCTGTCGC GACGCCGCTG GTTCGGCATG
AGCATCGTGG TGGGCGGCTT CGGGCTGCTG GTCGACGGCC TGGCGCCGCA CCTGTGGGTG
GCGATCGTCG GGACGCTCAT CGTCGGCGCG GGCGCCGGAA TGGCGTTCCT GTCCGGGATC
ACGCTGCTGG GCCGCGAGGT CGAGGACACG GTGCGCGGCC GGATGTTCGC GTTCATCTCC
ACCAGCGCCC GGGTGGTCCT GATGGTCACG ATCTCGGCCG CCTCGGTCAT CGGCGGCTAC
GGCTCCGCCC GCCAGGTCGA CATCGGCCCG CTGACCTTCG ACTTCTCCTT CGGACGCATC
CTGCTGCTGA TCGCCGCCGT CGTCGGTGTG CTGACCGGTT ACATTGCTTT CCGGCAGATG
GACGACAAAC CGGGGGTGCC GGTGATCAAG GACCTGTGGG GTTCGATGCG GGGACGGCCG
CTGGTGCACG AGTCACCGGC GGGCGGGACC TTCGTGGTGT TCGAGGGCGG CGAGGGCAGC
GGCAAGTCCA CACAGGCCGT CAAGCTGGCG GCCTGGCTGC GGCAGCGCGG CCACGAGGTG
GTGCTGACCC GCGAGCCCGG CGCCACCGAA CTGGGCGTGC GCATCCGCAC CCTGCTGCTG
GACCCGGAGT CGGGGACCTC GCCCAGTCCG CGCACCGAGG CGCTGCTGTA CGCCGCCGAC
CGCGCCCAGC ACGTGTCCAA AGTGGTACGC CCGGCGCTGG ACCGGGGCGC GGTGGTCATC
TCCGACCGCT ACGTCGACTC CTCGCTGGCC TACCAGGGTT CCGGACGCGA ACTGCCCGCC
GACGAGGTCG CCTGGCTGTC GCACTGGGCC ACCGGCGGAC TCAAGCCGGA CCTGGTGGTG
CTGCTGGACA TCGACCCGCG CGTCGGCCTG GTGCGCGCCA CCAAGGGCTC GGCCGGGGAC
CGGCTGGAGC AGGAGGCGCT GACCTTCCAC GAGGCGGTGC GGGAGAAGTT CCGCGACCTG
GCCGCCGACG ACTCGTCGCG CTACCTGGTC GTGGACGCGA CCCAGTCCCC CGAGGACATC
GCGGCCAAGG TCTCCGAACG GGTGGCCGCC GTGGTGCCGC CGGTGCCGGG CGAGACCGCC
GATGACGACG ATCCCAACCC GCCGATGGAC GCGCACGCCG AGGACAAGAC CGTGAAGTTC
GCCCCCGGAA AGGCGACGCT GTGA
 
Protein sequence
MSGVSDLKAV LRIRPFRRLW LVLGLSATGD WLGLLAMSLF AASQFDNTTA QGAAFSLVIV 
VRLIPSLLLG PLAGVFADRW DRRITMAVCD TLRFALFASV PLVAMWTGVG VKAAGWTAIA
TFLIEAVGMM WMPAKEAAVP NLLPRSRLEA ANQLTLVTTY GFAPVIAAGV MSVLEANWMA
GALGEVGDWA APAAIGLYLN ALTFLAAAAV VFFRIPEISV RRNEPGQAPK EQRGMLRDFA
DGWRHMGRNK MVRGLVLGIL GAFAGAGVII GTGQFFARSL GGGEAAFTVL FATVFVGLGL
GIVAGPALVG QLSRRRWFGM SIVVGGFGLL VDGLAPHLWV AIVGTLIVGA GAGMAFLSGI
TLLGREVEDT VRGRMFAFIS TSARVVLMVT ISAASVIGGY GSARQVDIGP LTFDFSFGRI
LLLIAAVVGV LTGYIAFRQM DDKPGVPVIK DLWGSMRGRP LVHESPAGGT FVVFEGGEGS
GKSTQAVKLA AWLRQRGHEV VLTREPGATE LGVRIRTLLL DPESGTSPSP RTEALLYAAD
RAQHVSKVVR PALDRGAVVI SDRYVDSSLA YQGSGRELPA DEVAWLSHWA TGGLKPDLVV
LLDIDPRVGL VRATKGSAGD RLEQEALTFH EAVREKFRDL AADDSSRYLV VDATQSPEDI
AAKVSERVAA VVPPVPGETA DDDDPNPPMD AHAEDKTVKF APGKATL