Gene Slin_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2998 
Symbol 
ID8726749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3629514 
End bp3630755 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content48% 
IMG OID 
ProductNa+ dependent nucleoside transporter domain protein 
Protein accessionYP_003387808 
Protein GI284037878 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGTT TTACCGGCCT ACTAGGCATT GTTTTAATTC TGGGTATTGC TTATGCCCTT 
TCTGATAATC GTAAAGCCAT TAATTATCGT ACTGTTGGGG TAGGGCTTGC GCTTCAATTT
GGTCTGGCTG TTTTCGTCTT AAAAACGGAC GTTGGACAGC GTCTCTTTCA GGGTCTTGGC
TATTATGTTG ATCGATTGCT TCAAAAAGCC AATATTGGTG CCGAGTTCGT TTTCTCCTCT
TTGGTGAGAC CGGATATACT TACAAGAGCT TTTGGCCCCG AGAACAGCTT TATTTTCTTC
TTCAAGGTTG TTCCAACCAT CATTTTTGTA GCCGTTTTGG TCAACATTTT TTATCACCTT
GGCATCATGC AGCGCATCGT GGCGGTGATG GCCAAAGCCA TGAAATGGTT GATGGGCGTG
AGTGGAGCCG AAGCGTTGTC CAACGTTGCC AGTACATTTG TTGGGCAGGT AGAAGCGCAG
ATCATGATTA AGCCGTACCT CAATGGCATG ACCAATTCTG AACTACTCGC CAGCATGACC
GGTTCGTTTG CTTGTATAGC CGGGGGTGTA CTAGCGGTTT ACATTTCTTT GGGCGTGCCT
GCCCCTTACC TACTGGCGGC CAGCATTATG GCTGCACCGG GGGCGTTGGT AATTAGCAAG
ATTGTAATGC CGGAAACGCA GGTATCAGAA ACCCAGGGAA CGGTTAAAGT TGAAATCAAA
AAAATCTACG CGAACCTGCT CGATGCCATT GCCGCTGGTG CTAGCGAAGG GCTTAAAGTA
GGGTTTAATG TAGTGGCCAT GCTGATCGGT TTTATCGCTC TGATCGCACT GATCGATAGC
CTGATGTTTC GCATAGGTTT CTACATTTTC AACGTCGATT TTCTGAGCCT TAACTACCTG
CTTGGCAAAC TTTTCTCTCT GTTTGCCTGG GCAATGGGCG TACCAGCGAA AGACATCGAA
GCCGCTGGTG CCCTGATGGG GACGAAAATG GTGGTTAATG AATTTGTTGC GTATCTGGAC
TTAGTGAAGA TAAAGCAGAC GCTTGACCCT AAAACAATCG CTATCGTCAG CTTTGCTCTG
TGTGGCTTTG CCAACTTTAG TTCGATTGCC ATTCAGGTCG GGGGCATTGG CGAACTAGCG
CCAAAACGCC GGAGCGATCT GGCGCGCATT GGCTTCAAAG CACTTATTTG CGGTACGCTG
GCAAGTTATA TGTCGGCTAC CATCGCTGGA TTGCTTTTGT AG
 
Protein sequence
MERFTGLLGI VLILGIAYAL SDNRKAINYR TVGVGLALQF GLAVFVLKTD VGQRLFQGLG 
YYVDRLLQKA NIGAEFVFSS LVRPDILTRA FGPENSFIFF FKVVPTIIFV AVLVNIFYHL
GIMQRIVAVM AKAMKWLMGV SGAEALSNVA STFVGQVEAQ IMIKPYLNGM TNSELLASMT
GSFACIAGGV LAVYISLGVP APYLLAASIM AAPGALVISK IVMPETQVSE TQGTVKVEIK
KIYANLLDAI AAGASEGLKV GFNVVAMLIG FIALIALIDS LMFRIGFYIF NVDFLSLNYL
LGKLFSLFAW AMGVPAKDIE AAGALMGTKM VVNEFVAYLD LVKIKQTLDP KTIAIVSFAL
CGFANFSSIA IQVGGIGELA PKRRSDLARI GFKALICGTL ASYMSATIAG LLL