Gene Slin_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1991 
Symbol 
ID8725729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2403930 
End bp2405033 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content52% 
IMG OID 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003386835 
Protein GI284036905 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.310287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0147414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATT TTCCGATAAA ACCATATTCT TTCGCAAAGG TAACCAACGC AATGGCCCAC 
TCACTATTCC GGGTACATAT CATTCGTCAG GTACTCCTGC TGATTTTTAT AGCCATCGTC
TTCATTCGCT GCGGCAATAA CACGACCGAT GAGGCTGCTC AGTTTTTCCT GAAGGGGAAT
GTTCAGTTGC AGAAACGTGA GTACAAAGAA GCAATCCGGT TCTACTCAGA AGCCATCGCC
AAAAAATCAG ATTTTGCGGA CGCCTACAAT AACCGGGGGT TAGCCAAGTT TCGGGACGAC
GACCGGGAGG GGGCGTTGGC CGACTATACC CGGGCTGTTG AACTGGACCC TGATTTTGGT
ACGGCGTACT TTAATCGGGC CGAGGTTCTC CTTGAAACCG GTGATGCGGC CGGTAGTGTG
TCGGACTTGA TGCGGATCAA TAAACAATAC CAGGATTCTA CCTTTTATCA AACGCGTTTG
GGCGACGTCT ATGTACGACT GGGGAAGCAG GCCGAGGCTC AGGCAGCTTA TGACCGCGCT
TTGCAACTCA ACCCCGATAA TGTGGAGGCT CTAACCAATC GGGGGGCTTT GTTGTATAGC
CAGAAGGCCT ATGACCAGGC CGGTGAGGAC ATACAGCGGG CTCTTCGGCT CAATCCAAAG
CAAGATGCTG CCTTGAACAA CCAGAGTTTA CTGCTCGCGC GTGTCGGTAA TTTTGCCGAA
GCGCTCGTCT ATGTAGAACG TGCACTGGCT TTACAACCCC GACAGCCGTA TTACCTGAAC
AACAAAGCGT ATTTATTGCT GAAACTAAAC CGGGCTTCCG AAGCACTTCC GGTGGTGCAG
GAGTCTCTGC AACGCGATGA CCGGAATGCC TGGGCTCATC AAACCCTCGG GCTGTATTAC
CTGAGTCAGA AACAGGCGGA CAAGGCACTT ACCGAATTTC GGCAGACCGA AAAACTGGAT
GCGTCCGTAG ATCAGGTCTA TTATTATATC GGTCTAGCGG AGCAGGCCCT CAACCAGCAG
CAGGCCGCCT GCGAAGCCTG GCGACTGGGC GAATTGGCCG GGGATGAACA GGCCAGAAAA
ATCCGGGCTC AGCAATGTAA GTAG
 
Protein sequence
MNYFPIKPYS FAKVTNAMAH SLFRVHIIRQ VLLLIFIAIV FIRCGNNTTD EAAQFFLKGN 
VQLQKREYKE AIRFYSEAIA KKSDFADAYN NRGLAKFRDD DREGALADYT RAVELDPDFG
TAYFNRAEVL LETGDAAGSV SDLMRINKQY QDSTFYQTRL GDVYVRLGKQ AEAQAAYDRA
LQLNPDNVEA LTNRGALLYS QKAYDQAGED IQRALRLNPK QDAALNNQSL LLARVGNFAE
ALVYVERALA LQPRQPYYLN NKAYLLLKLN RASEALPVVQ ESLQRDDRNA WAHQTLGLYY
LSQKQADKAL TEFRQTEKLD ASVDQVYYYI GLAEQALNQQ QAACEAWRLG ELAGDEQARK
IRAQQCK