Gene Snas_2668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_2668 
Symbol 
ID8883863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp2807353 
End bp2809704 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content69% 
IMG OID 
ProductRNA binding S1 domain-containing protein 
Protein accessionYP_003511438 
Protein GI291300160 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.427217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.51511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCG ACATCAATGA GGCCATCGCC ACCGAGCTGG GCGTGGGTGA GCGGCAGGTG 
GCCGCCGCCG TGGACCTGCT CGACTCCGGA GCGACCGTGC CGTTCATCGC CCGGTACCGC
AAGGAGGCCA CCGGGACCCT GGACGACGCG CAGTTGCGCA CGCTCGAGGA GCGGTTGCGG
TACCTGCGGG AGTTGCACGA GCGGCGTGCG TCCATTCTGG ATGAGATTGA CAAGCAGGGC
AAGCTGACCG ACGAGTTGCG GGGGCAGATC ATGTCCGCCG AGACCAAGGC GCGGTTGGAG
GACCTGTATC TGCCGTACAA GCCGAAGCGG CGGACCAAGG CCCAGATCGC GCGGGAGGCC
GGGCTGGAGC CGTTGGCGGA GCGGTTGTTG GGCGACGCGA GTCTGGATCC GGCGGCGACC
GCTGCCGAGT TCGTGGACGC CGACAAGGGG GTCGCCGACG CTGAGGCGGC GCTGGCCGGG
GCGCGGGCGA TCCTGACCGA GCGGTTCGCC GAGGACGCGG ACCTGATCGG GCAGTTGCGG
GAGCGGATGT GGAAGCAGGG GCACATCACC TCGAAGGTGC GCGAGGGCAA GGAGACCGAG
GGCGCCAAGT ACTCCGACTA CTTCGAGTTC GCCGAGCCGT TCGGGAAGCT GCCCTCGCAC
CGGGTGCTGG CGCTGTTCCG GGGTGAGAAG GAGGAGGTCC TGACGCTGTC GCTGGAGGCC
TCGGCCGAGG ACGACGCCGC GAGCGTGGGG CCCAGTGACT ATGAGCGTGA GATCGCGGCG
CGGTTCGGGA TCGCCGATCG AGGGCGGCCT GCCGACAGGT GGCTGCTGGA CTCGGTGCGG
TGGGCCTGGC GGACTCGGGT GAGCGTCGGC TTGGCCGTCG ACACGCGGGT GCGGTTGTGG
CAGGCGGCCG AGGACGAGGC GGTGAGCGTG TTCGCCGCGA ACCTGCGGGA TCTGCTGCTG
GCTGCTCCCG CGGGGACCCG GACAACGCTG GGGCTGGACC CGGGGTTCCG TACCGGGGTG
AAGGTCGCCG TCGTGGATGC GACCGGCAAG GTCGTCGACA CCGCGACGAT CTATCCGCAT
CAGCCGCAGA ATCGGTGGGA CGAGGCGCTG GCGACGTTGG CGGCGCTGGT GAAACGGCAC
AATGTGGAGC TCATCGCGAT CGGCAACGGG ACCGCGTCGC GGGAGACCGA CAAGCTCGCC
GGTGACCTGG TGAAGTTGGC CAAGGGGCAT CAGCTGACCA AGGTGATGGT GTCCGAGGCG
GGGGCGTCGG TGTATTCGGC CTCGGCCTAC GCTTCGAGGG AACTGCCCGA ACTGGACGTT
TCGATCCGGG GCGCGGTGTC GATCGCGCGG CGGTTGCAGG ATCCGTTGGC GGAGCTGGTG
AAGATCGATC CGAAGTCCAT CGGGGTGGGT CAGTATCAGC ACGATCTGGC CGAGCACAAG
CTGTCGCACA GTCTGGACGC CGTCGTGGAG GACTGCGTCA ACGGCGTTGG CGTGGACCTC
AACACGGCTT CGGCGCCACT GTTGGCGCGG GTGTCGGGTA TCAGTTCGGC GCTGGCCGAC
AACATCGTGG CGCACCGCGA CACGGCGGGC GCGTTCACGT CGCGCAAGGG GTTGCAGGAT
GTGGCTCGGT TGGGGCCCAA GGCTTTCGAG CAGTGCGCGG GCTTCCTGCG GATCCGGGAC
GGGGTTGACC CGTTGGACTC CTCGGCGGTG CACCCGGAGG CGTATCCGGT GGTGCGGCGG
ATCGCGCAGG CCACCGGCAG CGACGTCAGC GGGCTGATCG GCAACCGGTC GGTGCTGGGG
TCGGTGAAGC CGCAGGACTT CGTGGACGAG ACCTTCGGTC TGCCGACCGT GACCGACATC
CTGTCCGAAC TGGACAAGCC GGGGCGGGAC CCGCGGCCGG AGTTCAAGAC CGCGGTGTTC
GCCGAGGGCG TGGAGAAGCT GTCCGACCTG GCGCCGGGGA TGGTGCTGGA GGGCACGGTC
ACGAACGTGG CGGCCTTCGG GGCCTTCGTC GACGTGGGGG TGCATCAGGA TGGTCTGGTG
CACATCTCGG CGATGTCGAA CGACTATGTG GCCGATCCCC GGGATGTGGC CAAGCCGGGT
GACATTGTGA AGGTGCGGGT GCTGGAGGTC GACGAGGCGC GCAAGCGGAT CTCGCTGACG
ATGCGGTTGC AGGACAAGGC CGAGGCGAAG CCGCCGAAGC AGTCCGACGG CAACAAGCGC
GGCAAGGGCG ATCGCAAGGG CGGCAAGGAC AACCGTAAGG GCGGCGGCAA GCCGCGCGGG
CGTGACGGCG GCAAGCAGTC CGAGCCGCAG GGCGCGATGG CCGAGGCACT GCGGCGAGCG
GGCTTGGCGT AG
 
Protein sequence
MTVDINEAIA TELGVGERQV AAAVDLLDSG ATVPFIARYR KEATGTLDDA QLRTLEERLR 
YLRELHERRA SILDEIDKQG KLTDELRGQI MSAETKARLE DLYLPYKPKR RTKAQIAREA
GLEPLAERLL GDASLDPAAT AAEFVDADKG VADAEAALAG ARAILTERFA EDADLIGQLR
ERMWKQGHIT SKVREGKETE GAKYSDYFEF AEPFGKLPSH RVLALFRGEK EEVLTLSLEA
SAEDDAASVG PSDYEREIAA RFGIADRGRP ADRWLLDSVR WAWRTRVSVG LAVDTRVRLW
QAAEDEAVSV FAANLRDLLL AAPAGTRTTL GLDPGFRTGV KVAVVDATGK VVDTATIYPH
QPQNRWDEAL ATLAALVKRH NVELIAIGNG TASRETDKLA GDLVKLAKGH QLTKVMVSEA
GASVYSASAY ASRELPELDV SIRGAVSIAR RLQDPLAELV KIDPKSIGVG QYQHDLAEHK
LSHSLDAVVE DCVNGVGVDL NTASAPLLAR VSGISSALAD NIVAHRDTAG AFTSRKGLQD
VARLGPKAFE QCAGFLRIRD GVDPLDSSAV HPEAYPVVRR IAQATGSDVS GLIGNRSVLG
SVKPQDFVDE TFGLPTVTDI LSELDKPGRD PRPEFKTAVF AEGVEKLSDL APGMVLEGTV
TNVAAFGAFV DVGVHQDGLV HISAMSNDYV ADPRDVAKPG DIVKVRVLEV DEARKRISLT
MRLQDKAEAK PPKQSDGNKR GKGDRKGGKD NRKGGGKPRG RDGGKQSEPQ GAMAEALRRA
GLA