Gene Snas_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1036 
Symbol 
ID8882221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1096548 
End bp1097567 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content66% 
IMG OID 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_003509839 
Protein GI291298561 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.179609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCCC ACAAGATCAC CAGAATCGTC GCTCTTCCCG CCGCGGTGGC CGTGCTGGCG 
CTCGGCCTGG CGGCCTGTGG TTCCCCGGGT TCCTCGGGCA CCAAGGACGC CGACAAGGTG
TCCGGCAAGG GCTGTGAACC CGTCGCCGGT GACAGCCTGG TCGCCCTCGA AGACGACAAG
AAACTGCAGG CCTCGGAGAA CATCCTGGCC GCCTTCAACG ACAAGGCCGC CGACGAGCAG
GCCATCGCCG CCGTCAACGC GGTCTCGGCG AAGCTGACCA CCGACGACCT CATCGAGCTG
AACAAGTCCG TCGACGTCGA CCGCAAGTCC GCGGCCAAGA CCGCCAAGGC CTTCGCCGAG
AAGAACAAGC TCACCAAGGG CCTCGACAAG GGCTCGGGCT CGCTGGTGGT CGGTCACGCC
AACTACACCG AGTCCGAGAT CGTCGCCAAC CTGTACGCGA TCGCGCTGGA GGGCGCCGGG
TACACCACCG AGCTGAAGGA CGTCGGCAAC CGCGAGACCT ACCTGCCCGC GCTGGAGGAC
AACGACTTCC AGGTCATCCC GGAGTACGCC GCCTCGCTGA CCGAGACGCT CAACCCGGAC
CCCGACGCCG ACTCGGCCAA GAACCCGATC GCCGACAACG ACATCGACAA GACCCTGGAC
ACGCTGAAGC CCTTCGCCAA GGACGCCGGA CTGGCGCTGA GCGAACCCGC CGAGGCGGCC
TCGCAGAACG CCTACGTCGT GACCAAGGCC TTCGCCAAGG AGCACGGCGT CAAGACGCTG
TCGGACTTCG CCGACAAGTG CAGCGGCAAG GCCTCGTCGC TGGCCGGTCC GCCCGAGTGC
CCCAAGCGCC TCTACTGCGA GGTGGGTCTC AAGGAGACCT ACGGAATCCA GTTCGGAACC
TTCAACTCAC TGGACCTCGG CACCGGCACC AAGCAGTCGG TCGCCAGTGG AGACTCCACC
GTCGGTACTG TGACCACAAC GGACAGTGCA CTCGCGGATG GTGTGACCGT CAAGGGCTAG
 
Protein sequence
MSPHKITRIV ALPAAVAVLA LGLAACGSPG SSGTKDADKV SGKGCEPVAG DSLVALEDDK 
KLQASENILA AFNDKAADEQ AIAAVNAVSA KLTTDDLIEL NKSVDVDRKS AAKTAKAFAE
KNKLTKGLDK GSGSLVVGHA NYTESEIVAN LYAIALEGAG YTTELKDVGN RETYLPALED
NDFQVIPEYA ASLTETLNPD PDADSAKNPI ADNDIDKTLD TLKPFAKDAG LALSEPAEAA
SQNAYVVTKA FAKEHGVKTL SDFADKCSGK ASSLAGPPEC PKRLYCEVGL KETYGIQFGT
FNSLDLGTGT KQSVASGDST VGTVTTTDSA LADGVTVKG