Gene Snas_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1142 
Symbol 
ID8882328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1220727 
End bp1222577 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content63% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003509945 
Protein GI291298667 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.163498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.236826 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT CAAGTTTCAC GGTGGCGTCG ACGAGCGTGT CCCGTCGCCG GCTGTTCCAG 
GCGGCGGGTC TCGGCGCCGC GGGTGCGGCC GGACTGGGAA CCATGACCGC CTGCAAGGCC
GACCCGGGTA TCCAGGGCAA AGGCGAGTTC CACGGCGGCT ACCCCTACGA GACCCCGCCG
GACGGCCACT TCAACACCGC CGGAGCCCCG TACGCGGTGG TGCCGCACGT GTTCGTCGAG
GGCATGTACC TGGACCTGAC CTGTATGCCC GGCGGTTACT ACTGGTGGGA CAAGCAGGAA
TGGGAGTACT TCCTGGCCGA GAGCTTCGAG CTCGACGACA AGGAGAACAC CTTCACCCTC
AAGGTGCGTG ACGGTCTGAA GTGGAGCGAC GGGGAGCCGC TGACCGCCAA GGACTTCGAG
ACCACCTACT GGTTGTGCTG GATCCGCAGC AACCCGATGT GGAAGTCGCT CGACAGCCTC
AAGGCCACCG ACGACATGAC CATCGAGGGC AAGCTGAGCA ACCCGTCCTC GGTCATCGAG
CGCTACATGC TCAAGACCAA CGTGGCGCCC AGCCACAACA AGGACTCCAA GATGGGCAAG
ACCTACCGGG ACTTCGCCGA GGCGGCGATG AAACTGCACG AGGACGGCAA GGACCAGACC
TCCAAGGAGG GCGAGAAGCT CGGCGCCGAC CTGGCCAAGT TCCGGCCGGA GAACCTGCTG
ACCTCGGGTC CGTTCAAACT GGAGAAGAAG GACTTCACCA AGACCCAGAT GGTGTTGACC
AAGAACAAGA ACGGCTTCAA CGCCGACAAG GTCAAGTTCG ACAAGGTCGT CGTCTACGAG
GGCGAACTGC CGCAGATCAC GCCGCTGGTC AAGGACAAGT CCGTCGACTA CGCCTCCCAC
GGTTTCGCGC CCAACCAGGA GAAGAAGTTC AAGCGCGACG GTCACAAGAT CGTCCGGCCG
CCGGTGTACT CCGGGCCGTC GCTGTACATC AACTTCAAGG AGGTGCCCGA GTTCAAGGAC
GTGCTGACCC GGCGCGCCAT CGCGCACGCC ATCAACCGCG CCGACGCGGG CAGTATCGCC
CTGGGCGACT CCGGTCCGGC CGTGAAGTAC ATGGCGGGCT TCTCCGACAT CATGGTCCCC
GACTGGATCT CCAAGGAGGA CCAGGACGCC TTCGACACCT ACGAGCACGA CCTCGACAAG
GCCGCCGAGC TCATGGAGAA GGCGGGCTGG AAGAAGGACG GTGACGTGTG GGCCAGGGGC
GACAAGAAGA TGGACTACGA GATCAAGTGG CCCTCCACCT ACGCCGACTG GTCGGCCTGC
GGTGACGCCA TCGTGGACCA GCTGACCGAC TTCGGCATCA AGCTGACCGC GCAGCCCGTC
GACGAGGAGC AGTACCTCGA GGAGATCGAC AAGGGCGAGT TCCAGATGGC CATCAACGTC
TGGGGCTCCT CGCAGCACCC GCACCCGCAC TTCGCGTTCG TCGCCGACCT GTTCACCCAC
AACACCCCGA TCGCCAAGAA CAACGGCGGC GACGGCATCG CCTTCGACCT GAAGGTGAAG
TCCAAGAAGC ACGGCGAGGT GGACCTGGAG GAACTGGTCC TCAAGGCCGG GCAGGGACTG
GACGAGAAGG AGCAGAAGGC CAACGTCACC AAGGTGGCGC AGGTGTTCAA CGAACTGCTG
CCGATCATCC CGATCTGCGA GCGGTACTCC AACAGCCCGA TCCTGGAGGG CGAGGGCAAC
CGGGTCAAGG ACTTCCCCGA CGAGGACGAC CCGATCTACA AGAACTCGCC CTACGCCGAC
AACCCGATCA CCCTGGGAAT CGTGACCGGC AAGATCACTC CCAACGACTA A
 
Protein sequence
MTDSSFTVAS TSVSRRRLFQ AAGLGAAGAA GLGTMTACKA DPGIQGKGEF HGGYPYETPP 
DGHFNTAGAP YAVVPHVFVE GMYLDLTCMP GGYYWWDKQE WEYFLAESFE LDDKENTFTL
KVRDGLKWSD GEPLTAKDFE TTYWLCWIRS NPMWKSLDSL KATDDMTIEG KLSNPSSVIE
RYMLKTNVAP SHNKDSKMGK TYRDFAEAAM KLHEDGKDQT SKEGEKLGAD LAKFRPENLL
TSGPFKLEKK DFTKTQMVLT KNKNGFNADK VKFDKVVVYE GELPQITPLV KDKSVDYASH
GFAPNQEKKF KRDGHKIVRP PVYSGPSLYI NFKEVPEFKD VLTRRAIAHA INRADAGSIA
LGDSGPAVKY MAGFSDIMVP DWISKEDQDA FDTYEHDLDK AAELMEKAGW KKDGDVWARG
DKKMDYEIKW PSTYADWSAC GDAIVDQLTD FGIKLTAQPV DEEQYLEEID KGEFQMAINV
WGSSQHPHPH FAFVADLFTH NTPIAKNNGG DGIAFDLKVK SKKHGEVDLE ELVLKAGQGL
DEKEQKANVT KVAQVFNELL PIIPICERYS NSPILEGEGN RVKDFPDEDD PIYKNSPYAD
NPITLGIVTG KITPND