Gene Snas_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3041 
Symbol 
ID8884240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3207148 
End bp3208473 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content68% 
IMG OID 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003511805 
Protein GI291300527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0982219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCGT TCCTGCGAAA GATCCCGTTC TCTGCCCAAC TGCTGGCCGG TCTCGTCGTC 
GGCCTCGGCC TGGGCTACCT CGCCCGCACC GCCGACCTCG GCTGGCTGAC CACCACGCTG
CAACAGGTCG GCGACCTGTT CGTCCAACTG CTGAAACTGG CGGTGCCGCC GCTGGTGTTC
ACCGCGATCG TCATCAGCAT CGCCAACCTG CGCAAGGTCT CCAACGCCGC GCGCCTGGTC
GGCAAGACCA TCGGCTGGTT CATGATCACC TCACTGATAG CCGTCGCGGT GGGTCTGGGC
CTCGGCCTGC TCACCAACCC CGGCTCCGGT GTGGACATCT CCACCAAGGG CGCCGAGGCG
CCCGACCACG CCGGTAGCTG GACCGACTTC ATCACCGGCA TCATCCCCAC CAACATCGTG
GACTCCTTCG TCCAGGTCAA CGTGCTGCAG ATCGTGTTCA TCGCGATCGT CGTGGGCGCG
GCGGCCGTCG CGGTGGGGGA CAAGGCGAAG CCGTTCCTGT CCTTCAACCA GTCGCTTCTG
GACCTGGTGC AGAAGGTGCT GTGGTGGATC ATCCGCTTGG CGCCCATCGG CACCGCCGGA
CTCATCGGCA CCGCCGTGGC CACCTACGGC TGGAGCCTGC TGGCCCCGCT GGCGACCTTC
AGCATCGACG TCTACGTCGG CTGCCTCATC GTCCTGTTGG GCGTCTACCC GCTGCTGCTG
GGCCTGGTCG GCCGGGTCAA CCCGGTGACG TTCTTCCGCA AGTCCTGGCC CGCCATCGAA
CTGGCCTTCG CGTCGCGCTC CTCGGTGGGC ACCATGCCGC TGGCGCAGCG CATCGTCACC
AAACGCCTCG GCGTTGACAA AGACTACGCG TCCTTCGCCT CCCCGTTCGG CGCCACCACC
AAGATGGACG GTTGCGCCGC GATCTACCCG GCGCTGGCGG CGATCTTCGT CGCGCAGGTC
TTCGGCGTGA ACCTGTCCAT AGGGGACTAC CTGCTGATCG CCTTCGTGTC GGTCGTGGGA
TCGGCGGCCA CCGCCGGACT CACCGGCGCG ATCGTCATGC TCACCCTGAC GCTGAGCACG
CTGGGCCTCC CGCTGGAGGG CGTCGGCCTG CTGCTGGCCA TCGACCCGGT GCTGGACATG
ATCCGCACCG CCACCAACGT GGCCGGTCAG ATGGTGGTGC CGGTGCTGGT GTCGCGCGGC
GAGAAGACCC TCGACGTGGC GGTGTTCAAC GCCCCCAACC AGCCGCTCGA CGGCTCGGAC
GCGGTCCAGC GCCCCGAGCG TGAGACCGGC GTGGTGCGCG AACCCGAACC GGCCTTCGGT
TCCTGA
 
Protein sequence
MLAFLRKIPF SAQLLAGLVV GLGLGYLART ADLGWLTTTL QQVGDLFVQL LKLAVPPLVF 
TAIVISIANL RKVSNAARLV GKTIGWFMIT SLIAVAVGLG LGLLTNPGSG VDISTKGAEA
PDHAGSWTDF ITGIIPTNIV DSFVQVNVLQ IVFIAIVVGA AAVAVGDKAK PFLSFNQSLL
DLVQKVLWWI IRLAPIGTAG LIGTAVATYG WSLLAPLATF SIDVYVGCLI VLLGVYPLLL
GLVGRVNPVT FFRKSWPAIE LAFASRSSVG TMPLAQRIVT KRLGVDKDYA SFASPFGATT
KMDGCAAIYP ALAAIFVAQV FGVNLSIGDY LLIAFVSVVG SAATAGLTGA IVMLTLTLST
LGLPLEGVGL LLAIDPVLDM IRTATNVAGQ MVVPVLVSRG EKTLDVAVFN APNQPLDGSD
AVQRPERETG VVREPEPAFG S