Gene Slin_0358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0358 
Symbol 
ID8724086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp454680 
End bp458024 
Gene Length3345 bp 
Protein Length1114 aa 
Translation table11 
GC content55% 
IMG OID 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003385221 
Protein GI284035291 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATC TTGCCTCTCT TTGTATAATC ATACCACTCG TTGTTCAGAC CAGTTTCGGT 
CAGGTGAAAA AACAGAATCT GCCTGGTACG GGCGGGATCA ACACGCTTCA GCAACAGTTT
ATTACACCAC CCGATGCCGC CAAACCGCGT GTGTGGTGGC ACTGGATGAA TGGCAACATA
ACTAAAGAAG GCATTCAAAA AGACCTCGAA TGGATGAAGC GCGTGGGTAT TGGCGGCTTT
CAGAATTTCG ACGCCAGCCT CATGACGCCC AACGTGGTAC CTAAAAAGCT GGTGTTCATG
ACACCCGAGT GGAAAGATGC ATTCAGGTTC ACTACCGAAC TGGCCCGTAA ACTACAGCTC
GAAATGGCCA TTGCCGGGTC GCCGGGCTGG AGCGTGACGG GTGGCCCGTG GGTGCCCCCT
GGCGATGCGA TGAAAAAATA CGTTTGGACC GAAACCCGGG TAGCGGGTGG TAAAGCATTT
GCTGGTAAAC TGCCCCAACC AGCGGCTACA ACGGGTAATT TTCAGCATAT ACCTCTGCCC
ACCGGGGGAG GAGGCTTCGG TGGAACGGTT GGTATACTGC CAACATTCTA TCAGGATGCC
GCCGTTATAG CTTATCGTCT TCCTGAAAAC GAAACCCCTT TGTCGTCCCT GAATCCTAAA
GTTACATCCA GCGGGGGGAC ATTTAATCTG GCTGAACTGA CCGATGGTGA CCTGACCAAC
GCCAGGCCAC TGCCACCCCT GGAAGTTGGG CAGGATATGT GGATTCAGTA TGAGTTCGAC
CGGCCACAAA CGTTCAAGGC ATTAACAATT GTTGGGGCTA CCAGCGGAGG GGCTCTGGCT
GAGTTCACTG GCGCACCCAA TAACCGGGCT TTACAGGTAA GCGACGATGG TATTAACTTC
CGGGAGGTTC GAGCGATTCC GGGCAGTACG GTGGCGCAGA GTACGTTGAG CTTCCCATCG
ACAACTGCAA AATACTTTCG CTTTACCTTT AAAACATTAC CCCCCGCCGG TAATCGCTTT
GCCGCGTTGA TGGGTGGAGA GGACAAACCG GGTAAGCCCG AGGGGGTCAG CGTTGCCGAA
CTGATGCTGC ATAACACCGA CCGTATCGAT AAGGTTGAAG AAAAAGCGGG TTTCAGCCCC
TGGTGGGAAG AAAAACTGCC TGCTGCATCG ACCGCAGCCG ACGCTATTCT GGTAGAGAAT
GTGGTGGATT TGACCTCGAA GATGAATGCC GATGGTAGCC TGAACTGGAC CGCTCCCGCC
GGAGAATGGG TCGTGGTTCG ATTGGGTTAT TCGCTTACTG GTCGGCAAAA TCACCCGGCT
TCGCCCGAAG CGACGGGGCT GGAAGTTGAC AAACTCGATA AGGTAGCCGT TCGGAAATAC
ATCGACACTT ACCTCGACAT GTACAAAGAT GCTACGGGTG GCCAGATGGG TAAGCAGGGG
CTGGAATACA TGGTGCTGGA CAGCTACGAA GCCGGCGCTA TGACCTGGAC CAAAGCTATG
CCCGACGAAT TTGCCAAACG GCGCGGTTAC AGCCTCATTC CCTGGCTTCC GGTCCTGACG
GGGCGGGTCG TGAAAAGCAT TGACGCCAGT GAAAAGTTTC TGTGGGATTA CCGCCGAACA
ATCGGCGAAC TGATTTCCGA CAACCATTAC GACGTCATTG GCGATGCGCT GCACCAGCGG
GGTATGAAAC GCTATACGGA ATCGCACGAA AACAAACGAA TCTACCTGGC CGATGGTATG
GACGTGAAAC GAAACGCCGA TATTCCCATG TCGGCTATGT GGACGCCCGG TAGTCTGGGG
CAGGGGAGCA ACGAAGAGCC CCGCAGCCAA GCCGATATTC GCGAGTCGGC ATCGGTGGCG
CATATTTACG GACAGAATCT GGTGGCGGCT GAGTCGATGA CATCCGTGGG CAATGCATTC
AGTTTCCACC CCGAGAAGTT GAAACGCACC GCCGATCTGG AAATGGCCTC CGGGCTGAAC
CGCTTTGTGG TGCATACCTC CGTACATCAG CCGCTGGACG ATAAAATGCC CGGCTTCTCG
CTGGGGCCCT TCGGGCAGTA TTTTACCCGC CACGAAACCT GGGCCGAGCC CGCTAAAGCG
TGGACCAGCT ATCTGGGCCG GAGCTGTTTC CTGCTTCAGC AGGGCAAACC GGTTGTCGAT
GTGCTGTACT ATTACGGGGA GAATAACAAC ATCACCCAGC TTTGCGCTAC CAAACTGCCG
GATTTTCCAT CGGGCTACGA ATATGATTTT GTGAATGCAA CGGCACTCCG GACGGCCCTG
CGTGTGGAAG GTGGGAAGAT CGTGGCGAGG AGTGGTCAGC CGTACCGGTT GCTGGTGCTG
GACGCGTCGG CGCGTTACAT GACCCTGCCG ACGTTAAAGA AGTTGGGCGA ACTGGTTAAA
GCTGGTATGC ACGTAACGGG TACTAAGCCC GAGCAGTCGC CGAGCCTGAG CGATAATCCC
GCGGAATTCA CCGCCCTGGT GAACCAGATA TGGAACCAGC CAACCGTGTC GACCAAACCG
ATTGAGGCCG TTTTGAGCGA GATGGGTATT GCCAAAGATG TGGACATATC GGGCGCTGCC
GCCGAAATCC TCTATGTACA CCGCCAAACG GCCGGGCAGG ATATCTACTG GCTCAATAAC
CGGAGCGACA ATACGAACGA AGCGCAGATC AGCTTTCGGG TAACGGGCAA GGTGCCTTTG
TTATGGAATC CGCAAACGGG CAAAACTGAA ACGGTCTCGT ATCAGGTGAA GGGCGACCGC
ACGGTTATTC CACTGCGATT CGAGTCGTGG GATGCGTATT TTATCGTATT TGGCGATAAG
ACTCCCGCAA TGGCTTACAC AAAACCAGCC ATTAAGGAGT CGGCGGTGGC GCGGATGGAC
GGTGCGTGGA ATATCCGTTT TCAGGATGGG CGGGGCGCTC CGCAAGGGGC CTCTTTCAAT
AAACTAGCGT CGTGGACAGA CAACACCGAT GCCGGAATAA AGTACTTCTC TGGCACGGCC
AGCTACGAGA AGTCATTCGA GTTAGCGAAC CTCAGTAAAG AAGCAGCGTA TATGTTGGAT
CTGGGCGATG TGAAGAACAT GGCCGAAGTA ATCGTAAACG GTAAAAACAT GGGTATCGTC
TGGAAAAAGC CCTTCTGTCT GCCCATCACC GGTGCTCTGA AGACAGGTAC CAACACCGTA
CAGATCAAGG TGACGAACCT CTGGGTAAAC CGGTTGATAG GTGATGCGCA GCCGGGTGTA
ACGAACAAAA TAACGTTCAC GACTATCCCG TTCTACCGTG CCGATTCGCC CCTGTTGCCG
TCGGGTCTGT TGGGGCCGGT ACAGGTACTG TTGGCTAGGC CCTGA
 
Protein sequence
MKHLASLCII IPLVVQTSFG QVKKQNLPGT GGINTLQQQF ITPPDAAKPR VWWHWMNGNI 
TKEGIQKDLE WMKRVGIGGF QNFDASLMTP NVVPKKLVFM TPEWKDAFRF TTELARKLQL
EMAIAGSPGW SVTGGPWVPP GDAMKKYVWT ETRVAGGKAF AGKLPQPAAT TGNFQHIPLP
TGGGGFGGTV GILPTFYQDA AVIAYRLPEN ETPLSSLNPK VTSSGGTFNL AELTDGDLTN
ARPLPPLEVG QDMWIQYEFD RPQTFKALTI VGATSGGALA EFTGAPNNRA LQVSDDGINF
REVRAIPGST VAQSTLSFPS TTAKYFRFTF KTLPPAGNRF AALMGGEDKP GKPEGVSVAE
LMLHNTDRID KVEEKAGFSP WWEEKLPAAS TAADAILVEN VVDLTSKMNA DGSLNWTAPA
GEWVVVRLGY SLTGRQNHPA SPEATGLEVD KLDKVAVRKY IDTYLDMYKD ATGGQMGKQG
LEYMVLDSYE AGAMTWTKAM PDEFAKRRGY SLIPWLPVLT GRVVKSIDAS EKFLWDYRRT
IGELISDNHY DVIGDALHQR GMKRYTESHE NKRIYLADGM DVKRNADIPM SAMWTPGSLG
QGSNEEPRSQ ADIRESASVA HIYGQNLVAA ESMTSVGNAF SFHPEKLKRT ADLEMASGLN
RFVVHTSVHQ PLDDKMPGFS LGPFGQYFTR HETWAEPAKA WTSYLGRSCF LLQQGKPVVD
VLYYYGENNN ITQLCATKLP DFPSGYEYDF VNATALRTAL RVEGGKIVAR SGQPYRLLVL
DASARYMTLP TLKKLGELVK AGMHVTGTKP EQSPSLSDNP AEFTALVNQI WNQPTVSTKP
IEAVLSEMGI AKDVDISGAA AEILYVHRQT AGQDIYWLNN RSDNTNEAQI SFRVTGKVPL
LWNPQTGKTE TVSYQVKGDR TVIPLRFESW DAYFIVFGDK TPAMAYTKPA IKESAVARMD
GAWNIRFQDG RGAPQGASFN KLASWTDNTD AGIKYFSGTA SYEKSFELAN LSKEAAYMLD
LGDVKNMAEV IVNGKNMGIV WKKPFCLPIT GALKTGTNTV QIKVTNLWVN RLIGDAQPGV
TNKITFTTIP FYRADSPLLP SGLLGPVQVL LARP