Gene Slin_2220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2220 
Symbol 
ID8725958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2690481 
End bp2691773 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content52% 
IMG OID 
ProductTPR repeat-containing protein 
Protein accessionYP_003387041 
Protein GI284037111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.13078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA GTGAAGAGCG AAGGAGTGCA AGACTGAGAG AGCGTTTTAC CCCAGCTGGA 
ATTCGCTCGT TCAGCCTTTC ATTCGTTCGC TCTTTCGTTT TATCGCTGGC CGTTTTAAGC
TTAGTAGTTG CCTGTAGCAG CGATGACCGG CAGTCGGCTC GCATACCGGA CTACCCAAAG
CCGGGCGACA GCAGCCGGAT TGAAGGCGCT TTGCGGGCAC TGACACTGGC CATCAACCAG
TCGTCACCGG CATCCGCCTA CGCCAAGCGG GCTGTCATCC TGCTGGCTAT GGGCAGGTTG
AATGAGGCAT TGGCCGACAT CGACGAGGCC ATCAGCCGAA ATGATAATTT AGGGTCGTAT
TATCTGACCC GCGCACAGGT GTTAAGGGCT TTACAACAGC CAGCCAAAGC ACTTCAGAAT
GCGCAGCGGG CCGAGATTCT GGGTGTCGAC ACCCCCGAGT TATACACATT ACAGGGCGAT
TTATTGCAAC GGCAGAATCA GTTCGATAAA GCAAAGCTCT ACATAGCCAA AGCGCTGCAA
ATGGCGCCCT ACGATGGCGA AGCTTACTTT TATAAAGGGC TGATGGCCGC TCGGCAGGGG
GACACAATAC AAGCTCTGGC CTTGTATCAG CATTCGCTCC GCCTGAAACC CCGCTATTTG
GAGACGTACA ACCAACTGGC GTCCATCTAC CGCACCACGG GTGATCTCAA CTCTGCTTTG
GTTTATAATG GACAGGCGTT ACGTTACTTC CCGAACAATC CCCGGCTATA TTACGGTCGC
GGGCTTATTT ACCATACGGA GGGTAAGCTC GATAGTGCCA TCGTCTATTA TCAGCAAACG
ATGAAGGTGC AGCCGGGGTA TTATCAAGCC TATTTTCAGA TGGGGCTGAT CAATCAGAAG
TTTCGGAACT ACTACGCAGC CCTAAACAAT TACCAGCGGG TGCAGGAGTT ACGGCCTCAG
TTCCCACGGA TTGATACCTA CATTGGATAT TGCCACGAGC AGATGGGTCA ATACGATCTG
GCCATAGCTG CCTATACCAA AGCAACCCAG CTAAACATTG CCGACAGGCA GGCCGCTGCG
GGTTTGTGGC GTTCTCAACG GAGGCAGTAT GCGCAAAACT CGTATAACTC CTTATTTTTG
TCGGATACGG CTGGCAAATC ATCGAGTCAG AATCGAAACC GAACTGAAAT TGATACCACG
CGTGTGCGTA TTTCAACTAT ACAGCCAAAA GCCCGCGTGA CAACCAATGC CGGCGATTCG
CTGCAGCGAA CCGTTAAACC AATTAACAAA TAA
 
Protein sequence
MKNSEERRSA RLRERFTPAG IRSFSLSFVR SFVLSLAVLS LVVACSSDDR QSARIPDYPK 
PGDSSRIEGA LRALTLAINQ SSPASAYAKR AVILLAMGRL NEALADIDEA ISRNDNLGSY
YLTRAQVLRA LQQPAKALQN AQRAEILGVD TPELYTLQGD LLQRQNQFDK AKLYIAKALQ
MAPYDGEAYF YKGLMAARQG DTIQALALYQ HSLRLKPRYL ETYNQLASIY RTTGDLNSAL
VYNGQALRYF PNNPRLYYGR GLIYHTEGKL DSAIVYYQQT MKVQPGYYQA YFQMGLINQK
FRNYYAALNN YQRVQELRPQ FPRIDTYIGY CHEQMGQYDL AIAAYTKATQ LNIADRQAAA
GLWRSQRRQY AQNSYNSLFL SDTAGKSSSQ NRNRTEIDTT RVRISTIQPK ARVTTNAGDS
LQRTVKPINK