Gene Slin_5040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5040 
Symbol 
ID8728805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6143961 
End bp6146876 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content53% 
IMG OID 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003389815 
Protein GI284039885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAC CGAACGTGAC AGAAGAAAAA ACGACCGAGC GTCAGCCCGG ACTTACTGAT 
ATTGACCTGA CAGGCTACGA CCAGATTGAA GTGCTGGGAG CCCGTGAACA TAATCTCAAA
AACATCGACG TCGTTATTCC CCGCAACAAA CTGGTGGTGG TAACGGGTAT TAGCGGGAGC
GGCAAATCGT CGCTGGCCTT CGATACCATT TATGCCGAAG GGCAACGTCG ATACATGGAG
AGCTTTTCGG CCTACGCCCG CTCGTTTATC GGCGATATGG AACGGCCCGA CGTCGACAAG
ATCAACGGCC TGAGTCCGGT GATTTCCATC GAGCAGAAAA CAACGTCTAA AAACCCCCGC
TCAACCGTCG GCACCACGAC CGAGATTTAC GATTTTCTGC GTCTGCTCTA CGCCCGTGCG
GGTGAAGCGT ATTCGTATGT AACGGGCCGG AAGATGGAGC GGCAGTCGCA GGACCAGATT
ATCGACACGA TTTTGGGGCA GTACGAGGGG CAAAAAATAA CCCTGCTGGC ACCCATCATC
CGAAGTCGGA AAGGTCATTA CCGCGAACTG TTCGTGCAGA TTGCCAAAAC GGGCTATACC
AAAGTGCGGG TGGATGGCGT GGTGCAGGAC ATTGTGCCGA AGATGCAGCT CGACCGCTAC
AAAATCCACG ACATCGAAAT CGTCATTGAC CGGCTTGTTC CTAAAACGGA AGATCGGTAC
CGCCTGAGTC AGTCGATCCA GACGGCCATG AAGCAGGGCA AAGGCGCTAT GCAGATGCTG
GATGCCGATG GAAAGCTGGT GTATTTCTCG CAGAACCTGA TGGACCCCGA ATCGGGCATC
AGCTACGACG AACCCTCGCC AAACTCATTC TCGTTCAACT CGCCCTACGG AGCTTGTCCG
GTTTGTAATG GACTGGGCGT TATTGAAGAA ATCACGGAAG AATCGGTCAT TCCCGATAAG
TCGCTGAGTA TTAGTCGGGG AGCCATTGCC CCGCTGGGCG AATACCGCGA GTTGTGGATA
TTTAAGGAGA TTGAAGCGAT TCTGAAAAAA TACAAGCTTA ACCTCACTAC GCCGGTTGTC
AAGTTTCCGG ATGATCTGCT GCATGCACTC ATGTATGGCA CGGAGGAGGA AGCCACCGTG
CCCTCGAAAA AATACGTGGG CGAAGATTAC TACAGCTTTA AGTTCGAGGG TATTGTCAAC
TTTCTGAAAC GTCAGCAGGA GAACAGCACC GATAAGATAC AGGAGTGGCT GAAGGATTTT
ATGGTGATAA AGTCCTGCCC CGAATGCCAC GGTGCACGAT TGAAGAAAGA ATCACTGTTC
TTTAAGATCG ACGAAAAAAA CATCTCCGAG CTGGCCCGCA TGGACATTTC GGAACTGACG
GCCTGGTTCG ATGGGGTAGA GGACCGGATG ACCAACCGCC AGAATGTGAT CGGGAAGGAA
ATCCTGAAAG AGATTCGCAA ACGCATCGGC TTCCTGCTCG ATATTGGTCT GGACTACCTG
ACGCTCGACC GGCCGTTGCG GACGCTGTCG GGTGGCGAAG CGCAGCGTAT CCGGCTGGCG
ACCCAGATCG GGACTCAGTT GGTGGGCGTG CTCTACATCA TGGATGAGCC GAGTATCGGT
CTGCACCAGC GCGACAACGT AAAACTGATT GATTCGCTCA AGAACCTGCG CGATCTGGGT
AACACCGTTC TGGTGGTAGA ACACGACAAG GACATGATGC TCGAATCGGA CTTTATCCTC
GACATTGGTC CCGGTGCGGG GCGGCATGGC GGGCAAGTTG TCAATCAGGG AACTCCCGAC
GAGTTTCTCC AAAACAAGTA CATTGGCGTT GCGGGCAGCA GCACCACCGC CGATTACCTG
AGTGGCCGAC GCGCCATTGA GGTGCCAAAG GAGCGCCGGA AAGGAAATGG CAAGTTTTTG
GTTATCAAGA ACGCAACGGG TCATAATCTG AAGAACGTAA CCCTACGCTT GCCGCTGGGC
CGGATGGTTA CCATTACGGG CGTGTCGGGC AGCGGTAAGT CGTCGCTGAT TCACGAAACT
CTGTTCCCGA TACTGAACAA GCACTTTTTC CGCTCAAAGC GGGAGCCATT ACCGTTCAAA
ACGGTGGAAG GGCTGGAGCA TCTGGACAAA GTGATCGAGG TCGACCAGTC GCCCATCGGC
CGGACGCCCC GCTCGAATCC GGCTACCTAC ACGGGCATGT TTTCGGAAAT CAGAACCCTA
TTTGCCGAAT TGCCCGAAGC TAAAATTCGG GGTTACAAAC CCGGTCGGTT CTCGTTCAAC
GTGAAGGGTG GCCGTTGTGA AGATTGCGAA GGCGCGGGTA TGAAGAAGAT CGAAATGGAG
TTTCTGCCCG ATGTTCACGT CATGTGCGAA ACCTGCAAAG GCAAACGCTT CAACCGCGAA
ACGCTGGAAG TTCGCTTCAA AGGAAAATCC ATTGCTGACG TGCTCGACAT GACCGTGGAG
CAGGCGCTGG ATTTCTTCGC CAGTCAGCCC AAAATTCTCC GGAAAGTAAC GACCCTGAAC
GACGTTGGCC TGGGTTATAT TACCCTCGGC CAGCACGCTA CAACGCTCTC GGGTGGCGAA
GCGCAGCGGG TGAAACTAGC CGAAGAGCTA TCGAAGAAAG ATACCGGCAA AACGCTGTAT
ATCCTGGATG AACCTACCAC CGGTCTGCAC TTTCAGGACA TTTCCCACTT GCTCGACGTG
CTGAATAAAC TCGCTGACAA AGGGAACACC GTTCTGATTA TCGAGCACAA TCTGGACGTC
ATCAAAGTAT CGGATCACCT GATCGACCTT GGTCCCGAGG GCGGTAATAA AGGTGGCAAC
ATTATCGCCG AAGGCCCACC TGAAAAGGTA GCTGAGGTCA AGGGAAGCTA TACCGGCAAG
TTTCTGAAAA TGGAGTTGAC GGGTGAAAAG GCGTAA
 
Protein sequence
MSQPNVTEEK TTERQPGLTD IDLTGYDQIE VLGAREHNLK NIDVVIPRNK LVVVTGISGS 
GKSSLAFDTI YAEGQRRYME SFSAYARSFI GDMERPDVDK INGLSPVISI EQKTTSKNPR
STVGTTTEIY DFLRLLYARA GEAYSYVTGR KMERQSQDQI IDTILGQYEG QKITLLAPII
RSRKGHYREL FVQIAKTGYT KVRVDGVVQD IVPKMQLDRY KIHDIEIVID RLVPKTEDRY
RLSQSIQTAM KQGKGAMQML DADGKLVYFS QNLMDPESGI SYDEPSPNSF SFNSPYGACP
VCNGLGVIEE ITEESVIPDK SLSISRGAIA PLGEYRELWI FKEIEAILKK YKLNLTTPVV
KFPDDLLHAL MYGTEEEATV PSKKYVGEDY YSFKFEGIVN FLKRQQENST DKIQEWLKDF
MVIKSCPECH GARLKKESLF FKIDEKNISE LARMDISELT AWFDGVEDRM TNRQNVIGKE
ILKEIRKRIG FLLDIGLDYL TLDRPLRTLS GGEAQRIRLA TQIGTQLVGV LYIMDEPSIG
LHQRDNVKLI DSLKNLRDLG NTVLVVEHDK DMMLESDFIL DIGPGAGRHG GQVVNQGTPD
EFLQNKYIGV AGSSTTADYL SGRRAIEVPK ERRKGNGKFL VIKNATGHNL KNVTLRLPLG
RMVTITGVSG SGKSSLIHET LFPILNKHFF RSKREPLPFK TVEGLEHLDK VIEVDQSPIG
RTPRSNPATY TGMFSEIRTL FAELPEAKIR GYKPGRFSFN VKGGRCEDCE GAGMKKIEME
FLPDVHVMCE TCKGKRFNRE TLEVRFKGKS IADVLDMTVE QALDFFASQP KILRKVTTLN
DVGLGYITLG QHATTLSGGE AQRVKLAEEL SKKDTGKTLY ILDEPTTGLH FQDISHLLDV
LNKLADKGNT VLIIEHNLDV IKVSDHLIDL GPEGGNKGGN IIAEGPPEKV AEVKGSYTGK
FLKMELTGEK A