Gene Slin_6107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6107 
Symbol 
ID8729888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7403490 
End bp7404569 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content52% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003390868 
Protein GI284040938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCT GGTACTCTCT GCATTGGTTC AGCCCATCGC AATGGCAGCA GTTCAAACTG 
GCGCACCCGC TGGCGCTCTG GCTGATTCCC GCTGTCTTGC TGCTGATTGC CATACGTTAC
TATTTATCCA GAAAATCGCG GCAACGACTG AAGATGTCCC TTGGTCAGAT CTCTGCCGAA
CCGGGGTCGT GGATGGGTCG GCAATCGGTG CAGTCTTTGC TGAGCCTGGG GCGGTATCTG
CTGCCGTTGT GTATGTTTCT GGGTACCGCC TGCCTGCTCA TTGCGCTGGC ACGTCCACAA
ATTATCCGGG AACTACGGGA GGAACAGTCA GAAGGTATTG ACATCATGCT GGCGATGGAC
GTATCGGTAT CCATGAGCGA ATCGGATATC CTCCCTACCC GGTTGGCTGC CGCCCGACGG
GTAGCGCAGG CATTTGTCAG GGGCCGCCGG AACGACCGTA TCGGCCTGGT TATTTTTGCG
GGAGAAGCGT TTTCGTTGTG CCCGCTTACA ACGGATTACA ACCTGCTGAA CCAGTATCTC
AACGACCTTA ACGATGGCAT GATCCGCACA TCCGGAACAG CCATTGGTGA TGCGCTGGCC
CGGTGCATTA ACCGTATGCG CGACCGTCCG GCTGCTTCCT CAGACACAAC TCAGGCCAAA
ACCGAACAGT GGAAGTCAGA GCGAAGCAAA GTAATTATTT TGTTGAGCGA TGGCGACAAT
ACGGCGGGTA ATCTGGACCC GATTACGGCC GCAAGCCTGG CGAAGGCATT TAACATCAAA
ATATATACCA TAGCCGTTGG CCAACCAGTA GCATCAGCCT CCGAAGCGTC TACGGTTGAC
GAAGGTATTC TGAAAAAGAT AGCCACAATA GGTAAAGGGA GTTTTTTCCG GGCGGTAGAC
AGTGGCCGGT TAAAAACGGT TTTTGCGCAA ATCAGCCAGC TCGAAAAAGC TCCGGTTCGC
GTTCGGGTGT ATGAAGATAT TCAGGATTAC TACCGGATTT ATATGTACTG GGGAATCACG
TTTTTATTGG GCACCCTGCT ATTGAAAAAC ACAATTTTTG GTAACGTGCT GGAAGATTGA
 
Protein sequence
MKPWYSLHWF SPSQWQQFKL AHPLALWLIP AVLLLIAIRY YLSRKSRQRL KMSLGQISAE 
PGSWMGRQSV QSLLSLGRYL LPLCMFLGTA CLLIALARPQ IIRELREEQS EGIDIMLAMD
VSVSMSESDI LPTRLAAARR VAQAFVRGRR NDRIGLVIFA GEAFSLCPLT TDYNLLNQYL
NDLNDGMIRT SGTAIGDALA RCINRMRDRP AASSDTTQAK TEQWKSERSK VIILLSDGDN
TAGNLDPITA ASLAKAFNIK IYTIAVGQPV ASASEASTVD EGILKKIATI GKGSFFRAVD
SGRLKTVFAQ ISQLEKAPVR VRVYEDIQDY YRIYMYWGIT FLLGTLLLKN TIFGNVLED