Gene Slin_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0520 
Symbol 
ID8724248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp640953 
End bp642167 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content50% 
IMG OID 
Productnucleoside:H symporter 
Protein accessionYP_003385383 
Protein GI284035453 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000056918 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCAA CGACCCGCGT CAAACTTTCC GTCATGATGT TTCTCCAGTT TTTTGTTTGG 
GGAGCCTGGT ACGGTCAGAT GAGTAAATAC CTGCTTACAC AACTTCATTC AACGGGCGAT
CAGGTCGGTA ATGCCTATGC GGCTTTCTCG CTGGCCATGA TCATCGCTCC CTTTTTCGTC
GGTATGATTG CCGACCGTTA TTTTGCCGCT CAAAAGGTGC TGGGTGTTCT TAATCTGCTG
GGTGCGGTCG TTTTGTACTT CATCACCCAA AATACTGACC CTGATAATTT TTTCTACCTC
ATTCTGGCGT ATTGCCTGAC GTTTGCGCCA ACGCTGGCCC TCACTGCCTC TATTGCGATG
CAGCAGATGA GTGTCCCCGA AAAAGAGTTT CCGGGCATTC GGGTGCTGGG TACGGTGGCG
TGGATTATCG TGACAAACAT CGTTGGTTAT TATGGTTTTG GCGATAAGGT GACCATCTTC
CAGCTATCCA TGTATTCGGC GGTTGTTTTG GGTATTTTTG CCTTCTTTCT ACCCAACACA
CCTCCCAAAG CGACGACATC TACGTCGTTC TCCCAGATTC TTGGACTGGA TGCGTTTAAA
CTGTTTAAAG ACCGGTCGTT TGCAATCTTC TTCCTGTCAT CGGTATTGAT CTGCATCCCG
CTTTCGTTCT ACTACGCTAT GGCTAACCCC TCGCTGACCG ATGGCGGTAT GCAGAATGTA
GAGAATAAAA TGTCGCTGGG GCAGGCGTCT GAAGTGATTT TCATGCTGCT GATTCCCCTG
GCCTATACGC GGCTTGGTGT TAAGAAAATG CTGATAGTAG GGCTGGTAGC CTGGATTGTC
CGGTTTATCT GCTTCGGCTA TGGCGACGGC GGCTCCGGCG AATGGATGCT CTATCTGGCT
ATCGTACTGC ACGGCGTTTG CTATGATTTC TTCTTCGTAA CGGGCCAGAT TTATACGGAC
AACAAGGCGG GCGAGAAAAT CAAATCGTCG GCGCAGGGGC TCATCTCCCT CGCTACCTAT
GGTATCGGGA TGGGTATTGG TTCCAAACTG TCGGGTATCG TGCTCGACAT GTATACCCGC
CCCGATGGCA CTAAAGACTG GCTAGCTGTG TGGCTCGTTC CGGCCGCTAT TGCCGCTGCG
GTATTGATCA TCTTTGTGCT GCTGTTTTCG GATAAGAAGA AAGCCGTTCC TAATGAGGGT
CAACTGGTAT CGTAA
 
Protein sequence
MLSTTRVKLS VMMFLQFFVW GAWYGQMSKY LLTQLHSTGD QVGNAYAAFS LAMIIAPFFV 
GMIADRYFAA QKVLGVLNLL GAVVLYFITQ NTDPDNFFYL ILAYCLTFAP TLALTASIAM
QQMSVPEKEF PGIRVLGTVA WIIVTNIVGY YGFGDKVTIF QLSMYSAVVL GIFAFFLPNT
PPKATTSTSF SQILGLDAFK LFKDRSFAIF FLSSVLICIP LSFYYAMANP SLTDGGMQNV
ENKMSLGQAS EVIFMLLIPL AYTRLGVKKM LIVGLVAWIV RFICFGYGDG GSGEWMLYLA
IVLHGVCYDF FFVTGQIYTD NKAGEKIKSS AQGLISLATY GIGMGIGSKL SGIVLDMYTR
PDGTKDWLAV WLVPAAIAAA VLIIFVLLFS DKKKAVPNEG QLVS