Gene Slin_4396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4396 
Symbol 
ID8728156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5330953 
End bp5332761 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content52% 
IMG OID 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003389176 
Protein GI284039246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCACG ACACGCCGAC CAACATGAAT TCAGCATATT CTACATATCG AAGCCATCGG 
CTACTCATCG GATTAGCAAG TAGTGTAGGT CTGCTCCTTT GCCTGGCCGA TTTACCTGCC
GTAGCACAGC GCAAACGCGA TAAAGCCGAT ACCGCTCTGG CTAAACCCGG CTCTACCTCG
ACCACTACCG TTCGGATGGA GGCCGAAACG CAGTTTACGG ATGGTATCCG GTATTTGATG
ACGGATGAAC CGTCTAAAGC CATTACTCAG TTTGCCAAAG TGCTGCAAAA AGACCCCAAC
AATGCAGCCG CTCAATACTC CACAGCCAGC GCGTACCTCA AAAGTGGAAA AGTGACCGAA
GCCCTTCCAT TTGCGGCTAA AGCGCATGCC CTGGATGTCG ACAATAAATT TTATTCCCTG
CTGCTGGCAG AACTTTACGT CAAGCAGAAG CGGTACGGCG AGGCCGAAGA CCTCTACGAA
GCCTTGTTGA AAAAAGGACC CGAAAACGCA GAATATGGCG TTGAACTGGC GGCTATCTAC
CTCTTCAATG AAAAGCCCGA CAAGGCGCTG GAAGCGTATA ATAAAGTAGA GCGTGAGCTG
GGTTTGAATG AAGAAATTAC CCGGCAGAAG CAACGCATCT ACCTCAAGCA GAACAAAATT
GAGAAGGCCA TTGAAGAAGC TGAGAAATTG GTGGCATCGG AACCCTCCGA CCCCGACTAC
CTGCTCGAAG GCGCCGAACT GCTCATTGCT AACGACCGGC CCGATCAAGC CATCGGCTGG
ATTGACCGGG CCCTTAAGTT AAGCACCGAT TTGCCCCAGG CGCATGTGCT GCTGGCCGAT
ATCTACCGTA AAAAGGGGGA TATGGACCGG GTTAGTAAAG AACTGAACCA GGTACTGGCC
AATCCAAACC TCGAAGCCGG TCTGAAAGCC CGAATCCTGT CGAGCTACAT GGGTATGACC
GGTTCCAATA CCGCAGCGCA GCAGGATGCC CTGGCTATGG CGCAGAACCT GGCCAAAACA
TCGCCCAACG ACCCCAAAAC ACAGGTAATG CTAGCCGATC TGCTTATGCA GCAGGGCAAA
AAAGCCGAAG CCCGCGATAC GTACGCCAAA GCTGCCCGTC TGGACGGCTC TATTTATGAA
GTTTGGGGAG CCCTGCTACA GCTCGATAAC GAACTGAATC AGGTTGATAG CCTCCTCATC
CACTCCGAAA AGGCGCTGGA AGTCTTTCCA ACACAGGGGC TGTTCTGGTA CTCCAACGGG
TCGGCCAATC TATACAAGCG CCGGTATCAG CAGGCCGTTG ACGCCCTCGA AGAAAGTCAG
AAGCTGCTGG CTGCTAGTTC GAGCAACGAG CTAAAAAAGG GAATCAGTGC CCAGTTGGGT
GATGCGTACA ACGGTCTTGG CGATTATGCC AAATCAAACG AATCGTACGA AGCCGTCCTG
AAAGTTGACC CCCTGAACGA CTACGTTCTG AACAATTACA GTTATTTCCT GTCTTTACGG
AAGGAAAACC TGCCTCGTGC TTTACAACTT GCCCAGAAAC TCGTTGAGCG CAACCCAACG
AATGCGACCT ATCTGGACAC CTACGCCTGG GTGCTTTATG TCTCGAAAGA TTACGCAAAA
GCAAAGCAGT ATCTCGAAAA AGCACTGGCC GATCCAGCAA ACGTGAGCGG AACCATTATT
GAACATTATG GCGATGCGCT TTACCAGTTG GGCCAGGCCG ACAAGGCACT TGAACAGTGG
AAGAAAGCGA AGATGAAAGG TGGTGCCAGC CCCGATATAG ATAAGAAAAT AACCTCTGGC
AAAATGTAA
 
Protein sequence
MHHDTPTNMN SAYSTYRSHR LLIGLASSVG LLLCLADLPA VAQRKRDKAD TALAKPGSTS 
TTTVRMEAET QFTDGIRYLM TDEPSKAITQ FAKVLQKDPN NAAAQYSTAS AYLKSGKVTE
ALPFAAKAHA LDVDNKFYSL LLAELYVKQK RYGEAEDLYE ALLKKGPENA EYGVELAAIY
LFNEKPDKAL EAYNKVEREL GLNEEITRQK QRIYLKQNKI EKAIEEAEKL VASEPSDPDY
LLEGAELLIA NDRPDQAIGW IDRALKLSTD LPQAHVLLAD IYRKKGDMDR VSKELNQVLA
NPNLEAGLKA RILSSYMGMT GSNTAAQQDA LAMAQNLAKT SPNDPKTQVM LADLLMQQGK
KAEARDTYAK AARLDGSIYE VWGALLQLDN ELNQVDSLLI HSEKALEVFP TQGLFWYSNG
SANLYKRRYQ QAVDALEESQ KLLAASSSNE LKKGISAQLG DAYNGLGDYA KSNESYEAVL
KVDPLNDYVL NNYSYFLSLR KENLPRALQL AQKLVERNPT NATYLDTYAW VLYVSKDYAK
AKQYLEKALA DPANVSGTII EHYGDALYQL GQADKALEQW KKAKMKGGAS PDIDKKITSG
KM