Gene Snas_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0072 
Symbol 
ID8881248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp80612 
End bp82501 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content66% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003508887 
Protein GI291297609 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.504192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAGC CCACTTTCCG CCAGCGTGCG CGCTATTGGT TCGACAACAC CATGTCCAAG 
GGCACCAAAG CCCTGATCAG CTGGCTGACC ATCATCACCC TGGTCGTCGT GGCCATCGGT
GCCGGTCTCG CCGTGCTGGC CTCGCTGATC GACCCCAAGG CCGAAGACGA AGGGTTCGCC
GCCAACCTGT GGACGGCGTT CATCCACGTC ATCGACCCGG GAACCATCAC CGGTGACACC
TCTACTCCGC TGTTCATCGG CATGATGCTG GTGATCACCA TCGGCGGTCT GGTCATCATC
TCGTCCCTTG TGGGTATTCT GACCACCGGT CTGGACGCCA AGCTGGAGGA ACTGCGCAAG
GGCCGCTCAC TGGTCGTCGA GAGCGGCCAC ACCGTCGTCC TGGGCTGGTC GGACCAGGTC
TTCACCGTCA TCTCCGAACT GGTGGAGGCC AACGAGAGCG AGAAACGCGC CTGCATCGCC
ATCCTGGCCG ACCGCGACAA GGTCGAGATG GAGGACGAGA TCCGGGCCAA ACTCTCCGAC
CTGAAGACCA CGAAGGTCGT GTGCCGCACC GGCGACCCGG CCGACCCCGA CGACATCGCC
ATCGTCAACC CCGAGCAGGC CAAGGGCATC GTCCTGCTCA CCTCCAACGA GGAGGACCCG
GACGCCCAGC TGGTGCGCAG CCTGCTCGCC GTCACCGAGG GCGGGCAGAA GACCGACGGA
CCGCACGTGG TGGGAGCGGT CACCGACAGT CGCAACCTGC CCGCGGCCCG GCTGGCCGGT
GGGCCCCGCG CCCAGGTCGT CGACGGCGAC GACATCATGG CGCGGCTGAT GGTGCAGACC
TGTCGGCAGT CGGGACTGTC GGTCGTCTAC ACCGACCTGC TGGACTTCGG CGGCGACGAG
ATGTACATGG TCGAGGAGCC GCGGCTGGTG GGCTGCACGG TGCAGCAGGT GGTGCACGCG
TACCGCGTCT CAAGCTTCAT GGGCATCTAC AACCCCAACA CCGGCAGCCG CATCAACCCG
CCGTCCTCGA CCGTCGTCAA CCCGGGCGAC CGGCTCATCA TGCTGTCCGA GGACGACAGC
ACCATCGTGC TGGACGGCGC GCAGCCGTAC ATCGAGGAGA AGGCCATCGT GGCGCGCGGC
GAGCACGGCT CCCGTCCCGA ACGCACCCTC ATCCTCGGCT GGAACGCCCG CACCCCAACG
GTTCTGGAAC AGCTCGACGC CTACGTGTCC CGAGGCTCCA CCACCGACGT CGTCTCCGAC
CACGGCGACA TGTCCACCCA GCTGCGTCGC CTCGGGCCGC AGATGAAGGT GCAGTCGGTG
AACTTCAAGG AGGACGACAC CACCAGCCGC GCGCTGCTGG AGTCGCTCAA CGTCGCCAGC
TACGACCACG TCATCGTGTT GTGCCGCGAC GACGTACCGG CGCAGTTGGC CGACTCCAAG
ACCCTCGTGA CGCTGCTTCA CCTGCGCGAC ATGGCCGAGA AGTCCGGCCA GCGCTACAAG
GTGGTCAGCG AGATGGCCGA CGACCGCAAC CGGGGCCTGG CCCAGGTGAC CCAGGCCGAC
GACTTCATCG TCAGCGAGAA GCTGATCAGC CTGATGCTGA CCCAGACCGC CGAGAACCCG
CACCTGTCGC AGGTCTTCAA CGACCTGTTC GACCCGGACG GCAGCGAGAT CTACCTGAAG
CCGTGCGAGT ACTACGTCCG GCCGGGCATG CCGCTCAACT TCTACACGGT GGCCGAGAGC
GCCAGGCGTC GCGGCGAGAC GGCCATCGGC TACCGGCAGG CGGCACTGTC CAGCCAGGCG
CCCACCTTCG GTGTCGTCCT CAACCCGGAC AAGGCGGCCG GTTTCACGAT GCAGGCCGGC
GACAAGGTGA TCGTGCTGGC CGAGGACTGA
 
Protein sequence
MSKPTFRQRA RYWFDNTMSK GTKALISWLT IITLVVVAIG AGLAVLASLI DPKAEDEGFA 
ANLWTAFIHV IDPGTITGDT STPLFIGMML VITIGGLVII SSLVGILTTG LDAKLEELRK
GRSLVVESGH TVVLGWSDQV FTVISELVEA NESEKRACIA ILADRDKVEM EDEIRAKLSD
LKTTKVVCRT GDPADPDDIA IVNPEQAKGI VLLTSNEEDP DAQLVRSLLA VTEGGQKTDG
PHVVGAVTDS RNLPAARLAG GPRAQVVDGD DIMARLMVQT CRQSGLSVVY TDLLDFGGDE
MYMVEEPRLV GCTVQQVVHA YRVSSFMGIY NPNTGSRINP PSSTVVNPGD RLIMLSEDDS
TIVLDGAQPY IEEKAIVARG EHGSRPERTL ILGWNARTPT VLEQLDAYVS RGSTTDVVSD
HGDMSTQLRR LGPQMKVQSV NFKEDDTTSR ALLESLNVAS YDHVIVLCRD DVPAQLADSK
TLVTLLHLRD MAEKSGQRYK VVSEMADDRN RGLAQVTQAD DFIVSEKLIS LMLTQTAENP
HLSQVFNDLF DPDGSEIYLK PCEYYVRPGM PLNFYTVAES ARRRGETAIG YRQAALSSQA
PTFGVVLNPD KAAGFTMQAG DKVIVLAED