Gene Slin_6007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6007 
Symbol 
ID8729788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7283481 
End bp7286528 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content55% 
IMG OID 
ProductEndonuclease/exonuclease/phosphatase 
Protein accessionYP_003390768 
Protein GI284040838 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00134837 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACG TTTTACAGGT GCTTTTAGCC GCTTCTTTGT GGCTAATGAC AGCTTCTGGC 
TTTGCCCAGA CAACGCTGGC CCAATGGAAT TTTGAAGGGG CACCCGGCTC GGTGACCGCT
GCTGCCGCAT CGACAATCAC GGCCGATAAT GCCGCTTTTG GATCGGGGGT AACCAGCGTC
ACGTTTGTAG CCGGTAATGG GGGAGGCCGG GCCTATACCG GAACGGGCTG GAATGTAACA
AGCCCGGACG TCAACAAGTA TATCCAGATT AGCCTCGGGC CGGTGGCCAG TTATACCATG
GCCCTGTCGC AGCTGAGTTT CGACGAGCAG CGGTCCGGTA CAGGACCAAC AACGTGGGTG
TTACGGTCGA GTCTGGACAA TTTTACTGCC GATATCAACA CGACGCCCAC ATCGACCACC
TCGCCTACGT CAAATGGCTC ATTTACATCG CGGGTGGTGT CGTTGAGTGC CAATACTGCC
TTTCAGAATT TAAGTACCCC CATCACCTTT CGGATTTATG GTTACGGAGC CAGCGGGTCG
GGCGGAACCT GGCGATTCGA TAACATTCAG GTGAGCGGCA CCACCGGCCC CAGCGATCCG
ACAGCACCGC TTCTGTCGGT TATACCGGGT TCGCTGCCCG GCTTCGCTAC GGTGCCCGGT
ACGCCTTCAA CGGCAAAATC GTATACGGTG TCCGGGGTCA ATCTTACGAG TGATCTGACC
ATAACTGCCC CAACGGGCTA TGAAGTCAGC AAAGACAACG GTACCAGTTA TGCTACTGTG
CAGACGTTGA CGCAGTCGGG CGGAATCATT GCGCCTACAG CCATCAGTGT CCGGCTTACG
GGGGCGGGCA ATGGTACTGT AAACGGCACA ATCACGAATG TGGCCGGTGC TGTTTCTAAA
AATGTGACGG TGAGTGGAAG CGTCGATGTG TCGAATGGAC CAGTGCCTAT CGCTACAGCC
CGGGGGCAGG TCGGTACAAC GGTTATCATT CAGGGGCGCG TAACGGTGAG CAGCCAGTTT
GGCGGCAAGC TGTTTTACAT ACAGGATGCT ACCGGCAGCA TTGCCGTTTA CGACCCTACC
ACAAGCTACG GAAATCAGGT ACAACTCGGC GATCTGGTGC AGGTGAGCGG ACCGGTGGCG
CTTTTTCAGG GGAAAAAAGA AATCAACGGG GTTGCGACCT TTCTGAAGGT CAATGATACA
AATCAGCCCG TTACCCCTCA GGTTATTACG GCTACGCAGC TCACCTCGGG TGCCTTCGAA
GGGCAGTTGG TAACGGTTCA GAATGCAACC ATTGGCGGCT CGGGGGCAAC GTTTCAGGGC
GGAGCCGCCG GGACGTATCC GCTCACGACC AGCGATGGGA CCGCCGAGTT GTTCGTAACG
AGTGCCTCCG ATCTGGTGGG GGCTACCAAA CCTACGGGTG CACTGACCGT TACGGGCATT
GCCGATCGTT ACATACCGAC CGATGCGTCG AAGAATGTCG TTCAGCTTAA CCCACGCGCC
ATCTTCGATA TTCCCGGTTC GGAAGTGCCC CCGCCACCAC CAACCATAAC GACCTGCCCG
GCCAATCGGA CCATCACCGA TAATGACCAG ACATTGAGTG TAGTAAGCTG GAACCTCGAA
TGGTACGGTT TTGATGGCGG TTCGTACACT TGTACCAATG GCTCCAGAAC GTACGCCGAT
AATGGCCCGA CAAATGAGAC GCTTCAGGCC CAGAATGTTC GCTCTGTAAT GGACGCGTTC
AACGCTGACA TTTATATTTT GCAGGAAGTG AGCGATAAGA ACCTGCTTGT GACCAATACC
CCGGCCGGCT ACGCGCTGAG CTGTTCCGAT CAATACACGT CGTACTTCTT TCAGGACTTG
TGCGATGCCA ACGGCAATCC GCAGGGGTTC AACCCAACGA GCCTCAACCA GAAAGTCTGC
GTAATGTACA AAACGAGCGT TGTTTCGATG ATTCCCGCCG AAAGCAAACC GTTGCTCACT
GACAAGTACA GCTATACCAC AACACCACGC AGCGATGCCT GGGCGTCGGG TCGGTTACCA
TACCTGTTTG TGGCGAACGT AACCGTCGAT GGGCAAACCC GGAAACTATA CATCGTTGAT
ATTCATGCCA AATCCGGATC AGCACAAGCC GATTACAACC GCCGGAAACA GGACATCATT
GACCTGAAAG CTGAACTGGA TGCCAATTAT GGTAATGTAA ACCTCATCAT GGCTGGTGAC
TACAACGACG ATGTCGATCA GTCCATTGCC GCCGGTAATC CTTCGTCGTA CGCCAACTTC
GTGAGCGACC CCAACTATAC CGTTATTTCC AGTGAGTTGA GCAGCAGCAA CTGTAACACC
GATGCGAACT TTACCGACGC CATCGACCAC ATTACGGTAT CCAACGAGCT GGCGTCGTCT
TACGTAGCCG GTTCGGTGGC ATCGGTTCGG CCAGCCGTTG TCAATTACGC CCTGACAACC
TCCGACCACT ATCCAACCTT CGCCCGTTTC ACACTAGCCA GTCCTTTACC GGTGCGCCTG
ACTTCCTTTG CTGCAAAGCC AGTTGGCGAG ACGGTGGGTA TATCCTGGAC GACGGCCAAC
GAAACGAACA GTGCTTATTT CGAGGTAGAG CGTAGTGTCG ATGCACGTGA GTTCGCGTCT
ATTGGCCGGG TAGCGGCTGC GGGAGATGCC CAATCAATAA AAACCTATGG CCTTGTCGAT
CAACACCCCC TGAGCGGTAC CAATTACTAC CGGTTGAAAC AGGTTGACCT GGATGGCAAG
ACGGCTTACT CGACAATTGT ATCTGTCGTG ATGGATAATC TAACGCCCGC TATGGAACTG
CTGGGGAATC CGGTCGACAA TCAGGCGATT CGGGTGGCTG TTCGTAACCT GCCGAACGCT
GTCTATCGTT TAACGACCCT TACCGGACGT GAGCTTCCTG TGCAGGGCCA AACTCAGGCG
GATGGGTCGA TGCTGCTGAC AACCGCACAG GCTCTCAGCC CCGGTGTGTA CCTGCTCCGG
GCTGATTCGG GAACTACCCG CATTATGCGG AAAGTGGTAA TACGGTAA
 
Protein sequence
MKHVLQVLLA ASLWLMTASG FAQTTLAQWN FEGAPGSVTA AAASTITADN AAFGSGVTSV 
TFVAGNGGGR AYTGTGWNVT SPDVNKYIQI SLGPVASYTM ALSQLSFDEQ RSGTGPTTWV
LRSSLDNFTA DINTTPTSTT SPTSNGSFTS RVVSLSANTA FQNLSTPITF RIYGYGASGS
GGTWRFDNIQ VSGTTGPSDP TAPLLSVIPG SLPGFATVPG TPSTAKSYTV SGVNLTSDLT
ITAPTGYEVS KDNGTSYATV QTLTQSGGII APTAISVRLT GAGNGTVNGT ITNVAGAVSK
NVTVSGSVDV SNGPVPIATA RGQVGTTVII QGRVTVSSQF GGKLFYIQDA TGSIAVYDPT
TSYGNQVQLG DLVQVSGPVA LFQGKKEING VATFLKVNDT NQPVTPQVIT ATQLTSGAFE
GQLVTVQNAT IGGSGATFQG GAAGTYPLTT SDGTAELFVT SASDLVGATK PTGALTVTGI
ADRYIPTDAS KNVVQLNPRA IFDIPGSEVP PPPPTITTCP ANRTITDNDQ TLSVVSWNLE
WYGFDGGSYT CTNGSRTYAD NGPTNETLQA QNVRSVMDAF NADIYILQEV SDKNLLVTNT
PAGYALSCSD QYTSYFFQDL CDANGNPQGF NPTSLNQKVC VMYKTSVVSM IPAESKPLLT
DKYSYTTTPR SDAWASGRLP YLFVANVTVD GQTRKLYIVD IHAKSGSAQA DYNRRKQDII
DLKAELDANY GNVNLIMAGD YNDDVDQSIA AGNPSSYANF VSDPNYTVIS SELSSSNCNT
DANFTDAIDH ITVSNELASS YVAGSVASVR PAVVNYALTT SDHYPTFARF TLASPLPVRL
TSFAAKPVGE TVGISWTTAN ETNSAYFEVE RSVDAREFAS IGRVAAAGDA QSIKTYGLVD
QHPLSGTNYY RLKQVDLDGK TAYSTIVSVV MDNLTPAMEL LGNPVDNQAI RVAVRNLPNA
VYRLTTLTGR ELPVQGQTQA DGSMLLTTAQ ALSPGVYLLR ADSGTTRIMR KVVIR