Gene Slin_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3988 
Symbol 
ID8727746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4794933 
End bp4796261 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content54% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003388777 
Protein GI284038847 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.74321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0290381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAG TAATCGGCAA AGACAAAAAG ACGCGGGTGC GCTACAGCAT GCTGGCACTG 
GTCTTCATCA ACGTAGTGAT CAACTACCTC GACCGGAGCA ATATTTCGGT GGCGGGGACT
GCGCTGAGTA AAGACATGGA CCTGTCGTCG GAACAGTTGG GGTTCATTTT CTCGGCTTTT
GGCTGGACCT ACGCCCTGCT ACAAATACCG GGCGGCCTTA TCGCCGACCG CTTTGGTCCC
CGCATTCTCT ATGCATTTTG TCTGATTACC TGGTCCTTGG CCACCGTTTG TCAGGGCTTT
GTTCGGGGGT TTGCCAGTCT GTTTTCGCTT AGGCTGGCAA CGGGCGCGTT TGAAGCCCCT
TCCTACCCCA TCAACAACCG CATTGTTACA AGCTGGTTTC CCGAACACGA ACGGGCTTCG
TCTATTGCCT TGTATGTTTC GGGACAGTTT ATCGGCCTTG CGTTTTTAAC ACCCGTACTG
ACTTATATCC AGAGTCAGTT CGGGTGGCAG GGTTTGTTCG TGTGTACCGG TATCGTTGGG
CTGATCTGGG GCGTTATCTG GTACCTCTTT TACCGCGACC CGCTCGATCA TCCGAAGGTG
AACGACGCCG AGCTGGCCTA CATCGAAGAA GGGGGTGGCC TGTTCAGAAG TCGGCAGGCG
GGTACGAATA AAGCGTCTGT CTGGAGCTGG GTAAACGTGA AGCAGGTGTT TTCCTCCCGC
ACGTTATGGG GAGTTTACAT CGGGCAGTTT GCCGTTAACT CCATGCTCTG GTTCTTCCTG
ACCTGGTTCC CCACCTATCT GGTCAAATAC CGGGGGCTGG ATTTCATCAA GTCGGGCTAT
CTGGCATCGG TACCTTTTCT GGCGGCCTGT GCGGGTCTGC TCCTCTCCGG CTTCGTCTCC
GACAGACTGG TGAAGCAGGG GAAATCGGTA ACGATGGCGC GTAAAGCACC GATCATCATC
GGTCTGCTGC TGTCGATCAG TATTGTCGGG GCCAATTACA CGAACGATAC CGCATTGATC
ATCGCCTTTA TGGCTTTGGC TTTCTTTGGC TCGGGTATGG CGTTGATCTC CTGGGTGTTC
GTATCTATTC TATCACCCAA ACATCTGATT GGTCTAACCG GTGGCGTGTT CAATTTCATG
GGCAATCTGG CGTCCATCGT AGTACCTATC GTGATTGGCT ATCTGGCCAA AGACGGTGAT
TTCAAACCAG CGCTCGTCTT CGTCGGCGCC CTGGGCCTGA TTGGAGCCTG TTCTTACATA
TTCCTGGTGG GCAAAATAGA ACGGGTCGTG ACTCATGACC CGCAGGAAGG GGTCTTTGCG
GGGGAGTAA
 
Protein sequence
MEQVIGKDKK TRVRYSMLAL VFINVVINYL DRSNISVAGT ALSKDMDLSS EQLGFIFSAF 
GWTYALLQIP GGLIADRFGP RILYAFCLIT WSLATVCQGF VRGFASLFSL RLATGAFEAP
SYPINNRIVT SWFPEHERAS SIALYVSGQF IGLAFLTPVL TYIQSQFGWQ GLFVCTGIVG
LIWGVIWYLF YRDPLDHPKV NDAELAYIEE GGGLFRSRQA GTNKASVWSW VNVKQVFSSR
TLWGVYIGQF AVNSMLWFFL TWFPTYLVKY RGLDFIKSGY LASVPFLAAC AGLLLSGFVS
DRLVKQGKSV TMARKAPIII GLLLSISIVG ANYTNDTALI IAFMALAFFG SGMALISWVF
VSILSPKHLI GLTGGVFNFM GNLASIVVPI VIGYLAKDGD FKPALVFVGA LGLIGACSYI
FLVGKIERVV THDPQEGVFA GE