Gene Slin_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3894 
Symbol 
ID8727652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4671964 
End bp4674030 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content57% 
IMG OID 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003388683 
Protein GI284038753 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.813544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGAC TCACACTTCT TGCCGCTTTC GTCAGTTTTT CTCTTTTATC GATAGCTCAG 
ACACCAGCCC CGTACGGTGC CGTTCCATCG CCCCGCCAGC TTGCCTGGCA TAAGCTCAAG
TACTATGCCT TCGTCCATTT CAATATGAAC ACCTTCACCA ATGAAGAGTG GGGACACGGC
ACCGAAACCC CCGATATGTT TAACCCCACT CAACTCGACT GTCGGCAGTG GGCGAAGGTG
GCAAAAGAAG CCGGGATGGA AGGGATTGTC ATTACGGCCA AGCACCACGA CGGCTTCTGT
CTGTGGCCGT CGAAATACAC GGAGCACTCG GTCAAAAACA GCAAATGGCG GAACGGGAAG
GGCGATGTGC TGAAGGATCT GTCGGAAGCC TGTAAGGAGT ACGGTCTGAA GTTCGGCGTG
TACCTGTCCC CCTGGGACCG CAATCACCCG GCCTACGGGA CGCCGGAATA CAACGAGGTG
TTCAAGAAAA CCCTTCAGGA AGTGCTGACC CAGTACGGCG ATGTGTTTGA GGTCTGGTTC
GACGGCGCAA ACGGAGAAGG GCCAAACGGC AAAAAACAGG TCTACGACTG GCCCGGCTTT
ATCGAAACCG TACGCAAATA CCAGCCCAAC GCCGTTATCT TCAGCGATGC CGGTCCCGAC
ATTCGGTGGG TAGGCAACGA AGATGGGTAC GCGGGCGAAA CCAACTGGAG CACGCTCAAC
CGCGATAAGG TCTACCCGGC TTATCCAAAC TACTGGGAGT TGACGCTGGG TCACGAGGAC
GGTACGCATT GGGTGCCTAC GGAGGTAAAC TGCTCCATCC GGCCGGGCTG GTATTACCAC
GCCAGCGAAG ACAACAAAGA GAAGTCGCTG GAACACCTGG TCGATATTTA TTACAGCTCC
ATCGGCCGCA ACGGCAACTG GCTCCTGAAC TTGCCCGTCG ATCGGCGTGG GCTGGTGCAT
GAAAACGACG TAAAACAGCT CATGGCGCTG AAAGCCTACA CCGACAAGGC TACGCACAAC
CTCGCCGGAG GGAAGAAAAT CACGAGCAAC AGTGTGTTCA GCAAGGCTTC GACCTTTGCC
GCCGGTAACG TGCTCGATGC TAGCCGGGAT ACCTACTGGG CCGCTGCCGA AGGCGCCAAA
CAGGCGACGC TGGATATTGA CCTGGGCAAG CCTACAACCC TCAACCGGCT GCTGATTGAA
GAGTATATTG CGTTGGGCCA GCGGGTAAAG AAGTTCTCGG TAGCCGCCTG GCAGAAAGAT
GGCTACCAGA CCATTGCCAG CGGAACGACC ATTGGGAACC GGCGGATTCT GCGATTCCCG
ACCGTTACGA CCACCAAAAT TCGGGTGAGC ATCGACGAGT CGAAAGCCAG TCCGCTGATT
CGGCATATCG AGATGTACAA CGCGCCCGAA CTGATCGTAA CGCCCGTGAT CAGCCGGAAT
AAAGAGGGTA TGGTTACGAT TGTCTGCCCC AAAACGACCG ACCCGGTCAT TACCTACACC
ACCGACGGCT CGGAACCGAC CGCCCAGAGT AAGCGCTTTA CCCAGGCCGT TGCTTTGCCG
CAGGGCGGGG TTATAAAAGC CCGCGCCTTT GTCGATAACA TGAAAAAAGC GAGCAGCCCC
GTAACGGCCG AATTCGACAT TAGTTCGGCG AAATGGACGG TTGTGTCGAC CGGCGACGCA
GTACCCAAAA AAGAAGCTGT CCGGCTAATC GACGGCAACG CGGACTCGTT CTGGCAGCAA
CGTAAACAGG CCGAAAGTCC AACATCGGTG GTGCTGGATT TGGGCGAGGA GCTGCCGCTG
AAAGGCTTCA CGTACCTGCC GCGTCAGGAT GGGAAAAAGG CGGGTATCGT GTACCGATAT
GCCGTTTCCG TAAGCCAGGA TGGGAAAACA TGGTCGGCAC CGGTAAGTCA GGGAGCGTTC
AACAACATCA ATAATAACCC CGTTGGGCAA GCCGTCCGCT TCGACAAACC ACAAACGGCC
CGGTTCCTGA AGTTCGACGC CCTCGAAACA ACCGAGGCCA GCGACGCCAC CGTATCCATT
GCCGAACTGG GTGTACTGAC CCGCTGA
 
Protein sequence
MRRLTLLAAF VSFSLLSIAQ TPAPYGAVPS PRQLAWHKLK YYAFVHFNMN TFTNEEWGHG 
TETPDMFNPT QLDCRQWAKV AKEAGMEGIV ITAKHHDGFC LWPSKYTEHS VKNSKWRNGK
GDVLKDLSEA CKEYGLKFGV YLSPWDRNHP AYGTPEYNEV FKKTLQEVLT QYGDVFEVWF
DGANGEGPNG KKQVYDWPGF IETVRKYQPN AVIFSDAGPD IRWVGNEDGY AGETNWSTLN
RDKVYPAYPN YWELTLGHED GTHWVPTEVN CSIRPGWYYH ASEDNKEKSL EHLVDIYYSS
IGRNGNWLLN LPVDRRGLVH ENDVKQLMAL KAYTDKATHN LAGGKKITSN SVFSKASTFA
AGNVLDASRD TYWAAAEGAK QATLDIDLGK PTTLNRLLIE EYIALGQRVK KFSVAAWQKD
GYQTIASGTT IGNRRILRFP TVTTTKIRVS IDESKASPLI RHIEMYNAPE LIVTPVISRN
KEGMVTIVCP KTTDPVITYT TDGSEPTAQS KRFTQAVALP QGGVIKARAF VDNMKKASSP
VTAEFDISSA KWTVVSTGDA VPKKEAVRLI DGNADSFWQQ RKQAESPTSV VLDLGEELPL
KGFTYLPRQD GKKAGIVYRY AVSVSQDGKT WSAPVSQGAF NNINNNPVGQ AVRFDKPQTA
RFLKFDALET TEASDATVSI AELGVLTR