Gene Slin_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3004 
Symbol 
ID8726755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3636764 
End bp3638452 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content51% 
IMG OID 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_003387814 
Protein GI284037884 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGA TTATCCCTGT CTTGCTGGTG TCAATGGTCT CTACCGCGAC CCTGGCACAA 
CCTAAGAAAA CGTTTCCGGG TACAGCGAAA TCTGTTACGT CTGCTCAACA ACCGGTGCCG
GGTAAACGGC TGGCAACCCC CAAACTTGTA GTCGGAATCG TTGTTGATCA AATGCGGTAT
GATTACTTAT ACCGCTATTA CAACAAGTTC AGCGCTGGTG GTTTTCGCCG AATGATGGAC
GGTGGGTTCA ACGCTCGAAA CAACCATTAC CATTATGCCG CAACTTATAC CGGTCCCGGT
CATGCCGCTA TTTATACCGG TTCGGCTCCC GCATTGAACG GTATTGTTGG GAATGATTTC
TACGAGCGAA ACATTGGCCG GTTATTGTAT TGCGCAGAAG ATACAACCGT TAAAACGGTT
GGAAATACGG GTACGGCGGG TAAAATGTCG CCCCGTAATT TGCTCGTCAC CACCATCGGC
GATCAACTGA AACTCGCCAC CGATGGTCGG GCTAAAGTTA TTGGAATTGC TTTAAAGGAC
CGGGGCGCTA TTTTACCGGC TGGTCATGCG GCCAATGCGG CTTACTGGTA CGACTCCAAA
GATGGCAATT TCATCAGTAG TACATTTTAT CAGAATGAGT TGCCGCAGTG GGTACAGGCG
TTCAATGCCC GAAAACTGTC TGATAAGTAT TTAAGCCAAA CGTGGGAAAC GACACTGCCC
CTAAGCCAGT ATACAGAAAG CACGCAGGAC GATGAACCTT ATGAAACTGT ATTAGCCGGT
GAGACAAAGT CTGTTTTTCC GCACGCGTTT GCCATTCAGG CGGGTGGTAG TAAATACGAA
CCCCTGCGAA CAAGCCCTTA TGGGGATGAA ATCACGAAAG AGTTTGCACT GGCGGCCCTT
AAGGAAGAGA AGCTGGGGCA GGGCAATGTA ACCGATATGT TATGCGTGAG TTTTTCGTCA
CCCGATTATA TTGGCCACGC GTTTGGTACA CACGCTATTG AAACCGAAGA TCAATACATT
CGGCTGGACC GCCAGTTGGC TGACTTGTTC AATCAACTGG ATGCGAGCGT TGGTAAAGGG
CAGTGGGTAG CGTTTTTATC GGCGGATCAT GGTGTTGTCG ATGCGCCGGG TTTTCTGCAA
CAGAATCGGA TTCCGGCCGG TGTGCGGGGC TACGGCGAGG TTGGTGAAGT GGTAAAGTCG
ACCCTCGAAA AGGCATATGG ACCCGGGCAG TGGGTGCTAT CGTACATGAA TCAGCAGATT
TATCTCAACC ATGCGCTGCT GACGGAGAAG AAGGTCGCCA TGCAGGATGT TTACGAACTG
CTTCGCTCCG TGTTGCTCAA GCAGAAAGCG GTAGTTAACG TTGTCAATCT GCATAACCTT
GGCGCAGAGG CATTACCCAC ATTGCAGGAA AACCTGTTCC GGAATGTGTA TCATCCTAAC
CGCAGCGGTG ATTTTTATGT CATGCAGCAA CCCGGTTGGC TTGAAGGGCG TAACAAAGGC
ACAACCCACG GCACAACCTA TGCCTACGAT ACGCACGTAC CCTTTTTGCT TTACGGTTGG
GGCGTTCGAC CGGGGCAAAC CCTTCGCCGG ACACATATCC ATGACATCGC TCCAACCATC
ACCGCCCTAC TCGGTCTGCT GGAGCCAAGC GGGTGTATCG GTAACCCGGT AGAAGAAGCG
ATTAAGTGA
 
Protein sequence
MRLIIPVLLV SMVSTATLAQ PKKTFPGTAK SVTSAQQPVP GKRLATPKLV VGIVVDQMRY 
DYLYRYYNKF SAGGFRRMMD GGFNARNNHY HYAATYTGPG HAAIYTGSAP ALNGIVGNDF
YERNIGRLLY CAEDTTVKTV GNTGTAGKMS PRNLLVTTIG DQLKLATDGR AKVIGIALKD
RGAILPAGHA ANAAYWYDSK DGNFISSTFY QNELPQWVQA FNARKLSDKY LSQTWETTLP
LSQYTESTQD DEPYETVLAG ETKSVFPHAF AIQAGGSKYE PLRTSPYGDE ITKEFALAAL
KEEKLGQGNV TDMLCVSFSS PDYIGHAFGT HAIETEDQYI RLDRQLADLF NQLDASVGKG
QWVAFLSADH GVVDAPGFLQ QNRIPAGVRG YGEVGEVVKS TLEKAYGPGQ WVLSYMNQQI
YLNHALLTEK KVAMQDVYEL LRSVLLKQKA VVNVVNLHNL GAEALPTLQE NLFRNVYHPN
RSGDFYVMQQ PGWLEGRNKG TTHGTTYAYD THVPFLLYGW GVRPGQTLRR THIHDIAPTI
TALLGLLEPS GCIGNPVEEA IK