Gene Slin_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0224 
Symbol 
ID8723952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp297408 
End bp298718 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID 
ProductXanthine/uracil/vitamin C permease 
Protein accessionYP_003385088 
Protein GI284035158 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.274555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.678538 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCAACTA CGTTTACTTC CCGTCGAACC GAAGTATTAG CCGGTATCTC TTCCTTTCTG 
GCCACCATGT ACATCATTGT GGTCAACCCG GCCATATTGA GTCAGGCCGA TTTACCCTTT
AGCGGGGTCC TGACGGCTAC CGTATTGCTG TCGTTCTTTT GCAGCCTGAT GATGGGCCTA
TACGCCCGCA ACCCCATTGT GGTGGCTCCG GGTATGGGGA TGAATGCGTT TTTCACCTTC
ACAACCGTCA AAGGCATGGG CATCCGCCCC GAAATCGCTC TGGGGGCGGT ATTCTGGTCG
GGTGTTCTGT TTCTACTGCT ATCTATTTTT AACGTGCGGT CGGCCATTGT ACGGGCTATT
CCGCAACCAC TACGCTATGC GGTTTCGGCC GGAATCGGGC TTTTTGTTAC GCTCATTGGC
TTCGAGAACG CCAGGTTCAT TGTGGCCAAT CCGGCTACGC TGGTGAGCAT CGCCCATTTC
AACGACCCTA TCGTTCTCAC GTTTGTCTTT GGCCTGCTGC TCATGAGCGT GCTGGTCGTG
CGTGATGTGC CGGGCGGCAT TATCGTCGGC ATTATCCTAA CAACGCTGGT CGCCTGGCCC
ATCGGGCGGT ACTGGGGTGA TGCCTCGGCC ATTAATTTCG GGCAGAAAAC GCTGGTCAAT
TTTCAGGGCG TTCTGGCCGC GCCCGACTTC TCGCTTCTGG GCAAGCTGGA CCTGATGGGT
TCGCTATCCT GGTCACTGTG GCCGGTTATT TTTGCCTTTG CGTTCACCGA TTTGTTCGAC
AGCCTGTCGA CCTTCGTAGG TGTTGCCGAA GCAGGTGGCT TGCAGGACGA AGACGGCCAA
CCGCGTAACC TGAACCGCTC GCTGATGACC GACGCCGTGG CTACTACGCT GGCGGGGATA
TTCGGCACCA GTCCAGGCAC GGCCTATATC GAATCGGCGG TGGGGATTGC GCAAGGGGGA
CGAACGGGCC TCACGGCAGT AGTAGCCGGT TGCTGTTTTT TGCCGTTTCT GTTTCTGTCG
CCCCTATTGT CGATCATACC AGCTATTGCC ACGGCTCCGG CCCTGGTGCT GGTGGGAGCC
TTCATGATGA AACCCATTAC GCGCATCGAC TGGAGTCAAC TCGACGATGC GCTCCCCGCC
TTTCTGGCGC TGGTTCTGAT TCCGTTCAGT TACTCCATCA CGCAGGGGCT CATATGGGGA
TTCCTTTCCT GGACCGTTAT CAAAGTTGCC GTTGGCAAGA GCCGCGAGGT ATCGACGGGT
CTCTGGATTG TCGATGTCTT TTGCGTACTG GCGTTGACGA GTGGTCATTA G
 
Protein sequence
MSTTFTSRRT EVLAGISSFL ATMYIIVVNP AILSQADLPF SGVLTATVLL SFFCSLMMGL 
YARNPIVVAP GMGMNAFFTF TTVKGMGIRP EIALGAVFWS GVLFLLLSIF NVRSAIVRAI
PQPLRYAVSA GIGLFVTLIG FENARFIVAN PATLVSIAHF NDPIVLTFVF GLLLMSVLVV
RDVPGGIIVG IILTTLVAWP IGRYWGDASA INFGQKTLVN FQGVLAAPDF SLLGKLDLMG
SLSWSLWPVI FAFAFTDLFD SLSTFVGVAE AGGLQDEDGQ PRNLNRSLMT DAVATTLAGI
FGTSPGTAYI ESAVGIAQGG RTGLTAVVAG CCFLPFLFLS PLLSIIPAIA TAPALVLVGA
FMMKPITRID WSQLDDALPA FLALVLIPFS YSITQGLIWG FLSWTVIKVA VGKSREVSTG
LWIVDVFCVL ALTSGH