Gene Slin_4722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4722 
Symbol 
ID8728486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5749022 
End bp5752006 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content56% 
IMG OID 
ProductImmunoglobulin V-set domain protein 
Protein accessionYP_003389499 
Protein GI284039569 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.933088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACATT TTGTACGAAT GGGGACGGCT CCCCGTAATC CGTTTCTTTA CCTGCTGCTC 
TGGGCAGGCC TTTGGCTGCT CTCCGCTCCC GCGCTGTTTG CGCAAACTGG TCTGCAACAG
TCGCTGATTG CCAACCCCGA TAATGCCGGA ACTTACCAGG GCTACCAGGG CCAGCCGATT
GTGTTTAACA ACGCGCTGTA TGGCTTGTAT CTGAACGAGA GTGGGGTCTA TCAATTAGCC
AAATACAATG GCACGAGTTT GACCCTGATT GCCAATCCCG ATAATGCCGG AACTTACGAG
GGCTATGATA GCGGTCTGAT TGTGTTTAAA AACGCGCTGT ATGGCGTTTA TAGGAACGCG
AGTAGAGTCT ATCAATTAGT CAAATACAAC GGCACAAGTT CGACCTTAAT TGCCAACCCC
GATGAGGCCC CAACTTACCT GGGCTATTAT GGCCAGCCGA TTGTGTTTAA CGACGCGCTG
TATGGCAAGT ATATGAACAA GAGTGGGGCG TATCAATTAG TCAAATACAA TGGCACGAGT
TTGACCCTGA TTGCCAACCC CGATAATGCC GGAAATTACC AGGGCTACCA TGGCGATCCG
ATTGTGTTTA ACAACGCGCT GTATGGCCGG TATATGAACG CGAGTGGGGC CTATCAATTA
GCCAAATCCA ATGGCACGAG TTTGACCCTG ATTGCCAACC CCGATAATGC CGAATTTTAC
CAGGGCTATG TAGACGATCT AGACTATCCG ATTGTGTTTA ATAACGCGCT GTATAGCCAG
TATCTGAACG CGAGTGGGGC CTATCAATTA GCCAAATACG ATGGCACGAG TTCTACTCTG
ATTGCCAACC CCGATAATGC CGGAAGTTAC GAGGGCCACT CGATTGTGTT TAACAACACC
CTGTATGGCC AGTATCTGAA CGCGAGTGGG GTTATTCAAT TAGCCAAATA CAATGGCACG
AGTTCGACAC TGATTGCCAA CCCCGATAAT GCCGGAAGTT ACCAGCGCTC CCCGATTGTG
TTTAACAACG CCCTGTATGG CCAGTATCTG AACGCGAGTG GGGTCTATCA ATTAGCCAAA
TACGATGGCA CGAGTTCGAC CTTGATTGCC AACCCCGAAA ATGGCCCAAG TTACCGGGGC
TATATGAGCG ATCCGATTGT GTTTAACAAC GCCCTGTATG GCAAGTATAT GAACGCGAGT
GGGGTTATTC AATTAGTCAA ATACGATGGC ACGAGTTCGA CCCTGATTGC CAACCCCGAT
AATGCCAAAG GTTGCGATGG CCACTCGATT GTGTTTAACA ACGCCCTGTA TGGCAAGTAT
CTGAACAAGA GTGGGGTCTA TCAATTAGTC ACTGGGGTAC CCTGTGCATT GTCGCTGAGT
ATCAACCCCT CCTCGCTAAC GATAACAGCG GGTGGCTCGG TCACGCTTAC CGCTTCCGGA
GCTACGACCT ACACCTGGAG CAACGGCAGC ACGGCCAACC CGCTTATCGT CAGCAACGTC
ACCAGTGCCA CGGCCTTTTC AGTGACGGGC GTAACGGGTA CGTGTTCGGC CACGGCCACG
GCCAGCGTGA GCGTGGCCAC CATCACGGCG GGTACGACCT CGGGAACCAT CACGGCTTGC
GCGGGTACGG CATCGGCATT GCCCGCCGTG CAGCAATTCA ACGTTTCGGG CAGCACTCTT
TCAGGAAACA TCGTGGCTAG TGCCCCGCTC GGTTTCGAGC TTTCCACCAC TGCCAGCACT
GGCTATGCGG CCTCTCTGAC GCTCACTCAA TCGAGTGGCG TAGTGGCCAA CACCACCATC
TACGTGCGCT CGTCTGCCTC GGCCAGCGGG AACCTTTCGG GCAATGTGAG TCTGGCTTCG
AGCGGAGCGA CGACGCAAAA CGTAGCCGTG AGTGGAACGG TTACCCCCCT GGCCACCATT
ACGGCCCAGC CCGTGGCCAG TTCATCGGTG TGCGCGGGCA CGACGGTCAC CGTCTCGGTG
AGCACCAGTG GCCCGGTGAG CAGCTATCAG TGGTATAAAG GCGGCACCCT GCTCAGCGGC
CAAACCTCGG CTACGCTCAC CCTGACCAAC CTCAGCACCA CCGATGCGGG CAGTTATTCG
GTCGTGGTCA CGGGCAATTG CAACAGCCTC ACCTCAACCG CCTTTAGCCT GACAATCGTT
GCCCGGCCCG ACGCGCCCGC CCTGACCCCC GCCAGCTCTA GTCTGGCAGC GACTCTGACA
CCCCTCTCGC TGACGGGCTT TGCACTGGCA ACCACCGGCA ATAGCCTCCA CTTCTTCCAA
GCCGGAGGTA GTGAACTCAG CCCGCCCACC ATCAATATTA CCACTGCCGG GGTAATGAGC
TTTTGGGTCG GCCAGACCAG CAACGCCAGC GGCTGCAAGA GTTCACTCAC GCCATTGAGT
CTGACCATCA CGGCCACCCC CACCAGCCAG ACGGCTACCC CCACCAGCCA GACCGTTTGC
CGCAGCACCA ACGTTACCCT GAACGTCACC GTGGAGGGAA CCGCCTACCA GTGGTACAAA
AACGGTACTA CCCTAGCCAA CAAACTTACC GAACTCACCA GTGCCCAGCG CGGTACGACC
ACCGCCACCC TGACACTGGT CAATTTGCAA ACCACCGCCG ACTACTACTG CAAAATCACT
ACTCCCAACG GCGTTCAGAC TGTGGGGCCC CTGAAGGTGA GTGTCAACTT TGGCTGTTCG
GCCCGGCCTG CGGCCGAGGA AGCAGACTTG CAACTATTGG TACTGGTCAG GCCAAACCCT
ATCGTAGACG GCCACCTGCG GGCCCTGGTG AAGGGGGCTC AGGGGCAAGC CCTGAACGTA
GCCCTCTACA GTCTGCAAGG GGAGTTGGTG AACCAGCAGG TCTGGGACTC AGCCCCGGCC
GAGGTCAATC TAGATTGGGA TATCAGCCAG CGCAGCATGG GTGTGTTACT CTTGCGGGCC
CAGACCCCAA CTCAGCAGCA AACCATCAGG CTTATCCAAA ATTAA
 
Protein sequence
MEHFVRMGTA PRNPFLYLLL WAGLWLLSAP ALFAQTGLQQ SLIANPDNAG TYQGYQGQPI 
VFNNALYGLY LNESGVYQLA KYNGTSLTLI ANPDNAGTYE GYDSGLIVFK NALYGVYRNA
SRVYQLVKYN GTSSTLIANP DEAPTYLGYY GQPIVFNDAL YGKYMNKSGA YQLVKYNGTS
LTLIANPDNA GNYQGYHGDP IVFNNALYGR YMNASGAYQL AKSNGTSLTL IANPDNAEFY
QGYVDDLDYP IVFNNALYSQ YLNASGAYQL AKYDGTSSTL IANPDNAGSY EGHSIVFNNT
LYGQYLNASG VIQLAKYNGT SSTLIANPDN AGSYQRSPIV FNNALYGQYL NASGVYQLAK
YDGTSSTLIA NPENGPSYRG YMSDPIVFNN ALYGKYMNAS GVIQLVKYDG TSSTLIANPD
NAKGCDGHSI VFNNALYGKY LNKSGVYQLV TGVPCALSLS INPSSLTITA GGSVTLTASG
ATTYTWSNGS TANPLIVSNV TSATAFSVTG VTGTCSATAT ASVSVATITA GTTSGTITAC
AGTASALPAV QQFNVSGSTL SGNIVASAPL GFELSTTAST GYAASLTLTQ SSGVVANTTI
YVRSSASASG NLSGNVSLAS SGATTQNVAV SGTVTPLATI TAQPVASSSV CAGTTVTVSV
STSGPVSSYQ WYKGGTLLSG QTSATLTLTN LSTTDAGSYS VVVTGNCNSL TSTAFSLTIV
ARPDAPALTP ASSSLAATLT PLSLTGFALA TTGNSLHFFQ AGGSELSPPT INITTAGVMS
FWVGQTSNAS GCKSSLTPLS LTITATPTSQ TATPTSQTVC RSTNVTLNVT VEGTAYQWYK
NGTTLANKLT ELTSAQRGTT TATLTLVNLQ TTADYYCKIT TPNGVQTVGP LKVSVNFGCS
ARPAAEEADL QLLVLVRPNP IVDGHLRALV KGAQGQALNV ALYSLQGELV NQQVWDSAPA
EVNLDWDISQ RSMGVLLLRA QTPTQQQTIR LIQN