Gene Slin_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2048 
Symbol 
ID8725786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2477882 
End bp2479501 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content51% 
IMG OID 
ProductGeneral substrate transporter 
Protein accessionYP_003386892 
Protein GI284036962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.617643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.599548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCCA AACCTCAATA CCTAACTGAA AAAATGATAG TAGAACAAAC CCCCAAATCT 
GAAGCCGGTA CCAAAACGTT ACTGAGCGTA ATTGGTGCCT CTTCGCTGGG AACCCTCATT
GAATGGTACG ATTTTTACAT CTTTGGCAGT CTGGCCGCTA TTCTGTCCAC ACAATTTTTT
CCAAAAGATA ACCCAACGGC CGCCTTGCTG TCTACGCTGG CAACGTTTGC CGCCGGTTTC
ATTGTCCGTC CGTTTGGAGC GTTGGTATTT GGTCGGTTAG GCGACCTCGT TGGCCGGAAA
TATACGTTTC TGGTGACGCT GGTGCTGATG GGTGGATCAA CATTTTTTAT CGGGCTGATT
CCGGGTTATG AGACTATCGG TTTCTGGGCA CCGTTGCTTG TGCTGCTGCT TCGCCTGATA
CAGGGGCTGG CGCTGGGAGG TGAATACGGC GGAGCAGCTA CGTACGTAGC CGAATATGCT
CCTAAAGGTC GTAGGGGTTA TTATACCAGC TTTATTCAGA CAACGGCAAC CTTAGGACTG
TTCGTGTCGC TGGGCGTGAT TGTGGCAACC CGGCAGGTGG TGGGCGTGGA GGATTTTGCC
AAGTGGGGCT GGCGTATCCC CTTTGTCTTG TCCGCTTTAC TGGTAGGCGT TTCTATTTAT
ATACGCTTGA AAATGTCGGA GTCGCCGGTG TTTACCAAGC TTAAATCGGA AGGGAAAACA
TCCAAGAACC CACTAGCCGA GAGCTTTGGC AAGCGCGAAA ACCTGAAAAT GGTCTTACTG
GCCTTGTTTG GGGCTACTAT GGGGCAGGGT GTTATCTGGT ACACAGGTCA GTTTTATGCC
CTGTCGTTTA TACAGAAAGC CTGTAATGTC GAATTTGTCC AATCTAACAT TGTCGTGGCG
GTGGCCCTGC TGATTGCCAC CCCCTTCTTC GTAATATTTG GTGGTCTGTC GGACAAAATA
GGCCGTAAAG GCATCATGCT GGCCGGTATG GCGCTGGGCA TTCTGACCTA TCGCCCAATT
TACGAGAAGA TGTATAGCCT GACGGATTTG AAGGCGAAAC AGGAACTGAC CGAACAGACG
AAAGTAGACG TTAAAAAAGC GTTGGTTGCG AACACCGCCG ATTCACTGAT CACCACAACG
ACGACCAAAG CCTTCGCCGA TGGTACCGCT TACAAGGAAG TGTCGAAACA GACCGTTCCG
GCCGATGCGT CGGTGGAGAA GCCCAAACCC GAGGTAGTTA AATCGGTGAC GATGTCGACG
GGTGATTTAA TGATGATGAT TTTTCTGGTG CTGCTTCAGG TGTTGTATGT AACGATGGTA
TACGGCCCGA TAGCCGCCTT TCTGGTTGAG TTGTTTCCAA CCCGGATTCG TTACACATCC
ATGTCGTTAC CCTACCATAT TGGCAACGGT GTTTTTGGCG GGTTGGTACC GTTTATTGCA
ACGGCGCTGG TAGCAACGGC TACGAAGGCG AATGAAACTG CGGCAGCGGC CGGTGTGGCT
CCCGTTGTTT CCAAAGCCTA TCTGGAAGGG CTCTGGTACC CGATTATTGT GGCCAGTGTG
TCCTTTATAA TTGGGTTGTT ATATCTGAGT AACCGCGCTA AAGCCGTCGA GCGCGACTAG
 
Protein sequence
MNSKPQYLTE KMIVEQTPKS EAGTKTLLSV IGASSLGTLI EWYDFYIFGS LAAILSTQFF 
PKDNPTAALL STLATFAAGF IVRPFGALVF GRLGDLVGRK YTFLVTLVLM GGSTFFIGLI
PGYETIGFWA PLLVLLLRLI QGLALGGEYG GAATYVAEYA PKGRRGYYTS FIQTTATLGL
FVSLGVIVAT RQVVGVEDFA KWGWRIPFVL SALLVGVSIY IRLKMSESPV FTKLKSEGKT
SKNPLAESFG KRENLKMVLL ALFGATMGQG VIWYTGQFYA LSFIQKACNV EFVQSNIVVA
VALLIATPFF VIFGGLSDKI GRKGIMLAGM ALGILTYRPI YEKMYSLTDL KAKQELTEQT
KVDVKKALVA NTADSLITTT TTKAFADGTA YKEVSKQTVP ADASVEKPKP EVVKSVTMST
GDLMMMIFLV LLQVLYVTMV YGPIAAFLVE LFPTRIRYTS MSLPYHIGNG VFGGLVPFIA
TALVATATKA NETAAAAGVA PVVSKAYLEG LWYPIIVASV SFIIGLLYLS NRAKAVERD