Gene Snas_4235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4235 
Symbol 
ID8885436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4527430 
End bp4530699 
Gene Length3270 bp 
Protein Length1089 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003512977 
Protein GI291301699 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGTTGT CCGCTGGGGC GGTGTCGGTG CGCGTCAAGC CCGACTTGAC GAAGTTCGCC 
GCCGAGTTGA AGGCGTTTCT TGTCGCGGCG TCGCGCGATG TCGTGCACAT CGGCGCCGAC
CTCGACGCCG GGAAGCTGGC CACCCAGGTC AAGGCCGCCG TGACCCGTGC CGGTGCGGGA
CGGGGCGTCG AGGTTCCGGT CACCGCCGAC ACCACGAAAC TCGCCGCCGC CGTGCGCTCG
GGCACCCCCA GCGGCAAGAC GAGCGTGGCC GTCGACGCCG ACACCTCGAC CGCGCTGTCG
CAAGTGGCCC GGTTGAAATC GCAATTGGAC GACCTGTCGC GCCGACACAT TGAACTGTCG
GCCTCAGGTG ATACCGCAGG CGCCGAGCAC CTGGTCGCCC AGATTCGCGA CGCCGCCGCA
GCCGCGCAAG AAATCGACCT GTCACACATC AAGGGCGCCG CGCTGGGCGT CGACACCAAG
GCCGCCCGCG CCGCCGCCGA GAAACTGCGT ACCGACCTAG CGCGCCTGTC GGGCGAGACC
TTCCGTATCG ACGTCACGGC CAACGCGGGC GATGTCGAGC GGGTGCGCGC CGAGCTTCAC
ACGCTGGCCG CCGATGCGTC CGATATCGAC ATTCCCGTCG CGGCCGACAC CTCGAAACTC
GCCGCACAAG TGCGCTCGGG TCTGGCCGCC GCCGACGGCG GACGGGTGCG CGTCCCCGTC
GCGGCCGATG CCGGGCAGCT TGCCGGGCAG GTGCGCCGCG CCGCCGCCCT GGCGCAACAA
GGCACCCGCA TCACGGTGCC CGTGGGCGCC AACACCAAGG GCATCGGCGG AAGCCTGGCC
GGGCTGTCCG GCATCGGTGG CGCACTGGCC GGCATCGGCA AGGTCGCCGC CATCGGCACC
AGCCTCGCGG CTGCGGCTGG TGGTGCGGCA CAACTGGCCG CAGCACTGGC CCCCGTGGCC
GGGGCGCTGG CCGCGTTGCC CGCGTTCGCC CTCGGCGCGG CGGGTGCCTT CGCCGTCCTC
AAGCTCGGAT TGTCCGGTGT GGGCGCCGCC TTGTCGGGTG ATTCGGCCGC ATTCGCCCAA
CTCGCCCCCT CAGCGCAAGC CGCCGTCACC GCGATACGCG GGCTGTCGCC GCAATTCGAG
CGACTCAAGA CCAGCATTCA GGGCAACCTG TTCGCCGGGC TCGACAAGCA GATAACCCGC
GTCGCCGACG TCCTGTTGCC CAAACTTCAA CAGCGGCTAC CGGCCATCGC CACCGCCTTC
AACGGCCTGG CCAAGGGCGT CGCCGGTGGC CTGTCGTCCG ACGGGTTCAC CTCCGGACTC
GACGCCGCGC TGTCCCACAC TGCGACCGGG ATAGCCGGGC TCGCCAAGGG AATGCGCCCC
CTGTTTTCCG GGCTCGGCCA GGTCATCGGC GCTTTCGCGC CGAGCCTGAC GGCTGCCGGA
CAAGCCGCCG GTGGACTGGC CGCGCGCTTC GGCGAGTTCA TCAGCAAGGC CGCGGAAACT
GGGCAACTGG CGTCGTTCGT TGACGGCGTT AAGACCGCAC TGTCTCAAGT GGGCGGCATC
TTGTCGAACC TCGGAAGTGT CGTCGCCTCG GTGTTCTCGG CCGCGTCGAG CTCGGGCGGC
GGCCTGCTGG CCACGCTGGA AACCATCACC GGCGCCGCCC GCGAGTTCTT CTCCAGTCTC
GAAGGCCAAG AGGCGCTGAG CGGGTTTTTC GGCGGCATTC AGTCGGTGGT GCGCGCCGTG
TTGCCGACCT TGCAGAACCT CGCGTCCGGC ATCGGCTCGG TGCTCGGCCC CGCCGTCGGC
CAGATAGCCA CGATTTTGGG CCCCGCCCTT GAGACCCTCT CCGGCTCCCT CGTCGATGGC
GTCGCAAAGC TTCTACCAGG GCTAATGCCC GTGGCCGAGG CGCTGGCTAG CATCGTCACC
GCAGCGGCGC CCCTGCTGGG CGTGGCCGGT CAACTGGGCG CGATTTTGGG CGGCATCCTG
GGCTCTGCTT TGTCCATGGT GGCCAGTCTG TTTCAGGCAC TGGTGCCGCC AGCCGTCCAA
ATTGCACAGA TCTTGCTGGC GAGCCTGATG CCTGCCTGGC GCTCGATTGA ATCGGCGCTA
CAAACTGTGA TCGCCGCCGT TTTGCCGGTG ATTCAGGCGT TCCTATCGGT CAATCAGGCG
CTCGCCCCGA TCCTCGGGCT CATCGTGCGC GTGGCCGCCG CGATCTTGTC GGGGCTGGTC
AAGGGGCTAA TCGCGCTGAT CACGCCGAGC CTGACGATCA CGAAGATTTT CGTCGGCGCT
CTGGCCAAGG GTATAGCCAC AGTGTACGGA TGGCTGAAGG AAAAGCTCGG ACCCGCCGCC
GCGTGGCTCG CCTCGGTCTG GAACGAAAAG GTGTCGCCCG CACTGTCGAA GGTGTCGGCG
TGGCTGTCCG AAAAGTTCAG TGCCGCCGCC GCGAGCGCCT GGTCGTGGAT CAAGGAAAAG
CTTGCGCCCC TTGGGCAAAA GCTCGCGACC ATATGGAGCG AAAAACTGGC GCCCGCGATC
CAAAAGGTCG GCCTGTGGTT TCGGGAGAAG TTCGTTCCCG CAGCTCAGCA GGTGTGGACA
TGGATACAGG AAAAAGTCAT CCCCGTTGTG TCCAAATTGG TGCGATGGTT CGTGTCGAAA
CTGGTTCCCG GAATCGAGCG CGTCGTCGAG TGGCTCGTCG ACCTCGCGGG CTGGTTTTTG
GACCTGGCGG TCGACATCGG TACCGCCGTC GGCAAGGCCA TCCGCTTTTG GGGCCGCTTC
ATCGCGTTCA TCAAGGCGCT ACCGGGCAAG GTGATCGGGT TCCTTAAGGG ACTCCCGGCG
AAGTTTGTAC AAATTGGAAA GAACATCGTC TCGGGCATCG TTCGCGGTAT CAAGCAGGCG
GCCGGGCGCA TCAAGGACGC CGCCGTCGGC GCCGCCAAAA GCGCCTATGA GGGAGCGAAG
GATTTCCTCG GCATCAACTC GCCGTCACGG CTCATGGCGT GGCTCGGCTC CCAAATGGGC
GCCGGTGTCG AGGTCGGCCT CGACGGTTCC GTCGGCGACG TGGTGTCGGC CTCGCGTGGC
CTGGCGCGCG CCGCCTACGA CCCCTGGCGC GGCTTCACCC CCCACCCCAA GTCCCGCACC
GACACCGACG GCCCCGGCGG CGTGAACGTC TCGGTTCGCA TCGGCGAGCG CGAGGTCGCC
GACATGGTCG TCGACGCCGT TCGCGCCCGC CCCGAGGCCG TCGCCTCAAC CGTCGCCCAC
GGTACCCGTC TGTCCAACCT AAGGAGATAG
 
Protein sequence
MVLSAGAVSV RVKPDLTKFA AELKAFLVAA SRDVVHIGAD LDAGKLATQV KAAVTRAGAG 
RGVEVPVTAD TTKLAAAVRS GTPSGKTSVA VDADTSTALS QVARLKSQLD DLSRRHIELS
ASGDTAGAEH LVAQIRDAAA AAQEIDLSHI KGAALGVDTK AARAAAEKLR TDLARLSGET
FRIDVTANAG DVERVRAELH TLAADASDID IPVAADTSKL AAQVRSGLAA ADGGRVRVPV
AADAGQLAGQ VRRAAALAQQ GTRITVPVGA NTKGIGGSLA GLSGIGGALA GIGKVAAIGT
SLAAAAGGAA QLAAALAPVA GALAALPAFA LGAAGAFAVL KLGLSGVGAA LSGDSAAFAQ
LAPSAQAAVT AIRGLSPQFE RLKTSIQGNL FAGLDKQITR VADVLLPKLQ QRLPAIATAF
NGLAKGVAGG LSSDGFTSGL DAALSHTATG IAGLAKGMRP LFSGLGQVIG AFAPSLTAAG
QAAGGLAARF GEFISKAAET GQLASFVDGV KTALSQVGGI LSNLGSVVAS VFSAASSSGG
GLLATLETIT GAAREFFSSL EGQEALSGFF GGIQSVVRAV LPTLQNLASG IGSVLGPAVG
QIATILGPAL ETLSGSLVDG VAKLLPGLMP VAEALASIVT AAAPLLGVAG QLGAILGGIL
GSALSMVASL FQALVPPAVQ IAQILLASLM PAWRSIESAL QTVIAAVLPV IQAFLSVNQA
LAPILGLIVR VAAAILSGLV KGLIALITPS LTITKIFVGA LAKGIATVYG WLKEKLGPAA
AWLASVWNEK VSPALSKVSA WLSEKFSAAA ASAWSWIKEK LAPLGQKLAT IWSEKLAPAI
QKVGLWFREK FVPAAQQVWT WIQEKVIPVV SKLVRWFVSK LVPGIERVVE WLVDLAGWFL
DLAVDIGTAV GKAIRFWGRF IAFIKALPGK VIGFLKGLPA KFVQIGKNIV SGIVRGIKQA
AGRIKDAAVG AAKSAYEGAK DFLGINSPSR LMAWLGSQMG AGVEVGLDGS VGDVVSASRG
LARAAYDPWR GFTPHPKSRT DTDGPGGVNV SVRIGEREVA DMVVDAVRAR PEAVASTVAH
GTRLSNLRR