Gene Slin_3942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3942 
Symbol 
ID8727700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4724045 
End bp4725736 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content53% 
IMG OID 
ProductTPR repeat-containing protein 
Protein accessionYP_003388731 
Protein GI284038801 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.171262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.567153 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTA TTTTTTGGGT ATTGATCCCG TCACTTTTTT TAACTGCCTT TTTTGTTCTC 
AAGCCCTACT CCGTTTCTGA AACGGGCACA AGTATCGCCA TGTGTGGGTC TGATGCCAGT
GGCCGTATCA GCCAGCTGGA AAATGGGAAG TTTATGGTTC CCCTGCCGGG TTGGGGAGAT
CATTCGTATG TGATTTCCAC CCGGTCGGAC AGTGCGCAGT TTTACTTCAA TCAGGGTCTG
ACGATGTACT ACAGCTACCA CATGAAGGAA GCGATGGCTT CGTTTAAAGA GGTGGCCCGT
CTTGACCCTA CGTGTCCGAT GGCCTGGTGG GGTCAGGCAT TGGCCGGAGG CCCTTATTAC
AATGCGGCCC ATACATACAC CGTTCCCGCC GATATGCCCA CTATTCTGGC TCGCATGAAC
GAACTGGCAT CGAATGCCAC CGCGAAAGAG AAAAGGCTGA TTCAGGTGAT GAACACCCGA
TACGCGACAG TTGGGGCCGG GGAAGACCGA AAAAACCTCA ATGAAGCCTA CGCATCGGCA
ACGAAAGAGT TGATTACCGA ATTTGATGAC CCGGATATCA AAATGCTGTA TGTAGATGCC
ATTATGCTGA TTCATGCCTG GGATTTCTGG ACACCCGATG GAAAGCCCAA AGCCTGGACA
CAGGAAGTAG TCGACCTGAC CGGGGCTGTA CTGAAGCAAT ACCCGAATCA TCCGGCAGCG
CTGCATTACC AGATCCACCT AACTGAAGCG TCCCGACAGC CCGAAGTGGC CTTAACCAGT
GCCGATAAGC TCAAAACACT GCTGCCCGGC GTTGCGCATA TGGTTCACAT GGCGAGCCAC
GAATATCAGC GAAACGGTCT TTTTGAGCAG GGTGTACAGG TCAACGACAA AGCCGATGCC
AATTTGTTGA TCTACGATTC GCTGGCGGCT CATCTGAATT TGGTAAAGCA TTCGCCCCAT
TATTTTGCCG TTCAGACCTA CTGCGCGCTG TCGGGAGGGA TGTACGAAGT TGGCTTGAAA
GACGCCCTGC GCTGCCGAAA ATCCGTCTCG CCGGTAGCCG GGAATACCTA CGATCAATAC
CTGTATATGC TACCCTCGCT GACGCTGGTG AGGTTGGGTA AATGGAACGA GATCTTGGCG
GCACCGAAGC CGCAGAACGA CTGGGCCTAT GCCATGCTGC TGGATCATTT TTCACGAGGC
ATGGCGCTGG TAGCCCTCGG AAAAACGGCG GAAGCTCAGC ACGAGTTGAC ACGGCTTCGG
GAGCGGTTGA GTGACCCGAT TCTTGAAAAA CGACGTATTC CTTTCAATGC GCCCTTGCCC
GTGGCCCGCA TTGCCGAACA TATTCTGGAC GCATCCCTGC TTTTCTCTCG AAAGGCGCAT
GATCCGGCTT TTGCGGCCCT CGATCAGGCT ATCAACCTGG AAGATCAGTT GATTTATACC
GAACCCAGTG ACTGGCCGCT GCCAGCCCGG CAGTTTTTAG GCGCTTATCT GCTTCAACTG
AAGAAAGCGA AGGAAGCTGA AGTCGTGTAT CGTGAGGATT TGGCGCATCA TCCGGGCAAT
GGCTGGTCGT TGGTTGGCCT TCATAAAGCG CTTGCCCTCC AGGGGAAACG GGCCGAACTG
GCGAGAATCG AGGCTGGCTA CAAAACCGCT TTTTCAAAGG CCGAGCAAAT GCCGACATCG
TCTATTTATT GA
 
Protein sequence
MKAIFWVLIP SLFLTAFFVL KPYSVSETGT SIAMCGSDAS GRISQLENGK FMVPLPGWGD 
HSYVISTRSD SAQFYFNQGL TMYYSYHMKE AMASFKEVAR LDPTCPMAWW GQALAGGPYY
NAAHTYTVPA DMPTILARMN ELASNATAKE KRLIQVMNTR YATVGAGEDR KNLNEAYASA
TKELITEFDD PDIKMLYVDA IMLIHAWDFW TPDGKPKAWT QEVVDLTGAV LKQYPNHPAA
LHYQIHLTEA SRQPEVALTS ADKLKTLLPG VAHMVHMASH EYQRNGLFEQ GVQVNDKADA
NLLIYDSLAA HLNLVKHSPH YFAVQTYCAL SGGMYEVGLK DALRCRKSVS PVAGNTYDQY
LYMLPSLTLV RLGKWNEILA APKPQNDWAY AMLLDHFSRG MALVALGKTA EAQHELTRLR
ERLSDPILEK RRIPFNAPLP VARIAEHILD ASLLFSRKAH DPAFAALDQA INLEDQLIYT
EPSDWPLPAR QFLGAYLLQL KKAKEAEVVY REDLAHHPGN GWSLVGLHKA LALQGKRAEL
ARIEAGYKTA FSKAEQMPTS SIY