Gene Slin_4921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4921 
Symbol 
ID8728685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5992331 
End bp5993710 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content48% 
IMG OID 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_003389698 
Protein GI284039768 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.691187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCATC GGTATTCTAT ACTATTTTTC CCGCTTCATG TAATCGTTGA TTTCCTGAGC 
CTGAATGCTG CCTTCGTCGG GGCTTACATT CTCAAATTCC AGGCTGTCGA ACCGGTTGCA
GAGCCCCCTT ACGCGTCGCT TTGGGTAGTG TTTAACATAG TATGGTTGGT TGAAATTTTA
CTGCTTAAGC CCTATATATT TCCTCGTCAA CTCTTTAAGG CAGACCACTT AGTAAAAAAA
TTATTAATTC TGATGGCTAT TCATATAGCT GTCATATCCA TATACTGGGT AGCCGTAAAA
GGGTACTACT TTTCACGTGA ACACTTACTG GTCACCTACC TGCTGTTTAC CAGTTTGGCG
GTGGCTTTCC GGTTGGGTGG ACTGGTTTTT CTGAAAGAAT ATCGGGCCAG AGGGTACAAC
AATCGTCGGT ACGTGATCGT CGGTTATGGT AAGTTGGCTG TGTCGATCCA GCGGTTTTAT
GATGCGCATC CCGAAATGGG ATTTCGTTTC CTCGGTTATT TCGATGAGCC ATCTTCCGAA
AATCAGCACC TGCTCAGGGG GAATTACGAC GATTTGCCTG CGCACATTCA GCAAGAGGGA
ATAGATTGCG TGTACTGCTG TATGCCCTAC ATTGATAATG GTCGCTTGAA AAAGATTGTT
GAAGAAGCCG AGTCGGTCGA TTACCAGGTA AAGTTGTTGG TTGACTTTCG GGGGTTTCTG
GCGCGTGGCG CATCGGTCGA ATATCACGAT TTTCTGCCGG TATTGAATGT GTCCTCGCAG
ATGCTGGCCG ATTTTCAGGT AAACACGCTC AAACGGTTAT TTGACATTCT GTTCTCGTTG
GCTGCGCTGG GATTGGGGAT GCCCATGTTA ATTATTCTGG CTATTATAAC CAAGATTACG
TCGTCTGGCC CCATTTTTTA CGCACAGGAG CGCATTGGGC AGGGAGGTAA GCCCTTCAAG
ATTTATAAAT TCCGCAGCAT GTACGTCGAT TCGGAACGGT CGGGACCGGT GTTGTCGGGG
GGCTTGCTCG ATGACCGGAT TACGCCCTGG GGACGGTTTA TGCGTAAAAC CCGGCTCGAT
GAAATGCCTC AGTTTTACAA TGTGCTGATT GGAGACATGT CTGTGGTAGG CCCCCGTCCG
GAACGACAGT ATTTTATTGA TCAGATTGTT GAAATCGCCC CCGAATACCG GTCTTTGCTG
AAAGTAAAAC CGGGGATCAC GTCCATTGGG CAGATTAAAT ATGGCTATGC GGCCAACATT
GATGAAATGG TGCAACGGTT GCGGTACGAC CTGCTGTATC CCCGACGTCG CTCTTTTTTA
TTCGATATGT GGATCATTGC CCAAACGCTT CGGGTAATGG CCCAGGGCCG TGGCAAGTGA
 
Protein sequence
MRHRYSILFF PLHVIVDFLS LNAAFVGAYI LKFQAVEPVA EPPYASLWVV FNIVWLVEIL 
LLKPYIFPRQ LFKADHLVKK LLILMAIHIA VISIYWVAVK GYYFSREHLL VTYLLFTSLA
VAFRLGGLVF LKEYRARGYN NRRYVIVGYG KLAVSIQRFY DAHPEMGFRF LGYFDEPSSE
NQHLLRGNYD DLPAHIQQEG IDCVYCCMPY IDNGRLKKIV EEAESVDYQV KLLVDFRGFL
ARGASVEYHD FLPVLNVSSQ MLADFQVNTL KRLFDILFSL AALGLGMPML IILAIITKIT
SSGPIFYAQE RIGQGGKPFK IYKFRSMYVD SERSGPVLSG GLLDDRITPW GRFMRKTRLD
EMPQFYNVLI GDMSVVGPRP ERQYFIDQIV EIAPEYRSLL KVKPGITSIG QIKYGYAANI
DEMVQRLRYD LLYPRRRSFL FDMWIIAQTL RVMAQGRGK