Gene Slin_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4109 
Symbol 
ID8727868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4947115 
End bp4948845 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content49% 
IMG OID 
ProductRhs element Vgr protein 
Protein accessionYP_003388895 
Protein GI284038965 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0273732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGTTT ACTCTCCTGT TGTCGAAACG ACCCTGCTCA TCGAGGGAAA AAAAATTCCA 
ACGTTCCACT CCATAACCTT ACAGCAGTCT ATTCATACCA CACATGAATT CCGGGTTGTT
TTTGAACATG AGTCCGTTGA CGAATTAGTG GTGCTGTTCT CCGATCAACC TGAAAAATTG
AATCGAAAAT CAATAGAGCT TACCGTCAAA GCAGCTGGTG AGGGGCCTCC GCTGCAGTTT
AAGGGCGTGA TTACCCAAAC CGAATTAAAA CAGCAGGATG ATGGCTACTG GGGGCAAATG
ATCATAAGCG GACACGGTCA TTGTGAATCC CTGCTGACGA TAGCCGGTAC CCAGACGTTC
ACCGATTTAT CGATAACCGA TATTGTAAAT AAATGCCTGA GTAGCTACAC GCAGGTGAAA
AATGTGTCTG GCAGCGTAAA ACCCGCCAAG CTACCGTTTT GTGTTCGCTA TACAGAGTCG
GTCTGGCACT TCATAAAGCG GCTGGCATAC GATTTTGGGG CCTGGTTCTA TTACGATGGG
TCAGTACTTC GATTCACGAC CAGCCCGGGT ACAACTTCTA CGTTAAACCT GACCTTTGGC
GCCAATTTAA CGCACTTTCG CACGGGCGTC CGAGCGGTAC CCGCTTACTT TAAACAGTAC
GACTATTTAG CCGAAGAAGA CAAACGGCTG GAATCGGAAG CAGCCAAAGA CAAAACTCCT
TATGACAAGC CGGAAACGAG TGGTGTCATT ACGCCCCATC CGGCCCAGAC AGCTGCCGAT
ATGCCCGATT ACCGCGACAG TCGCCATGCT TCCCTAACCG CCGAAGAAAA ATATATGGAA
GGGCAAGCTC GGGTTCCGGG CCTGTTTCCG GGCAGTAAAA TCGTGGTTAA GGATTCTGAA
AGAGGAAAAG GCGGGCAATC AGCCCCTTAC CTCGTCACCG ACATTGTTCA CTACGTAAGT
GGTGTTGGCG AGTACACAAA CAGGTTTCGG GCCATTCCGG CCGATGTAGC CGCTATGCCC
GTGCGCAAAT TGGTTCGCCC GCTGGCCCAA ACGCAGATTG GTCAGGTTAC TGACATTAAA
GACCCCAAAG GCATGGGGCG GGTAAAGGTA AGGCTGCTTT GGATGAGTGG CTCGGAATCG
ACCCCCTTTA TCCGGATGAC TCAGCCGCAC TTCGGGGTTC ATACGGATAA CAAAAAAACG
CGCGGCTTTC AATTTGTGCC AACCATTGGC GACCAGGTCA TGGTCGGCTT TGAGTACAAC
AACCCCGAAC GTCCCTTTGT GATTGGCGCC TTACCCCACG GTAAAAACAG TGGTATGGAC
ACCAGTAAGC CCGATGAAGA AAAGCACATC AGCGTAGGAA GTGGCAGTAC GCTGACGTTT
ATTGAAAAAC CCAGCGTGAA AGAAATTCAT CTGCAAGTCG ATGAAAAGAA CTTCGTCAAG
ATCTCCGTAC CGAGTGCGGG TGGTGATATA ACGATCAACT CTTCGAAAAA TATTGTTGTC
AAAGCCACCG CGAAAGTGAC CATCGAAGCT CCCGAAATCG TGTTATCCGG AAACACCATT
ACCTTAGATG CCAAACAGGC GGTCAATATT AAAGGCACAC AGGTTAAAGT AGAAGCATCA
GCCCAGATGA ACATCAAAGG AGCCATGACC GATGTGGAAG GGTCGGGTAC GCTAAACGTC
AAAAGCTCAG GCATGACAGC CATTAAAGGA TCAATGGTTA TGATTAATTA G
 
Protein sequence
MPVYSPVVET TLLIEGKKIP TFHSITLQQS IHTTHEFRVV FEHESVDELV VLFSDQPEKL 
NRKSIELTVK AAGEGPPLQF KGVITQTELK QQDDGYWGQM IISGHGHCES LLTIAGTQTF
TDLSITDIVN KCLSSYTQVK NVSGSVKPAK LPFCVRYTES VWHFIKRLAY DFGAWFYYDG
SVLRFTTSPG TTSTLNLTFG ANLTHFRTGV RAVPAYFKQY DYLAEEDKRL ESEAAKDKTP
YDKPETSGVI TPHPAQTAAD MPDYRDSRHA SLTAEEKYME GQARVPGLFP GSKIVVKDSE
RGKGGQSAPY LVTDIVHYVS GVGEYTNRFR AIPADVAAMP VRKLVRPLAQ TQIGQVTDIK
DPKGMGRVKV RLLWMSGSES TPFIRMTQPH FGVHTDNKKT RGFQFVPTIG DQVMVGFEYN
NPERPFVIGA LPHGKNSGMD TSKPDEEKHI SVGSGSTLTF IEKPSVKEIH LQVDEKNFVK
ISVPSAGGDI TINSSKNIVV KATAKVTIEA PEIVLSGNTI TLDAKQAVNI KGTQVKVEAS
AQMNIKGAMT DVEGSGTLNV KSSGMTAIKG SMVMIN