Gene Slin_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2001 
Symbol 
ID8725739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2416184 
End bp2417686 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content45% 
IMG OID 
ProductTPR repeat-containing protein 
Protein accessionYP_003386845 
Protein GI284036915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0783856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0172593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTATAC AGACAGAATC CGTAATTTCG TTCTTTAAAA GAAAACACAA CCCCAGTTTA 
CACATGAAAC AAGTTTTTGT AATCGCCGTC GCGTGTCTCC TCTCACTAGG TGTACGCGCC
CAGCAGGCAG CCGGAGACGC GCTTGCTATG GAAGCATTTA AAAAAGAAAA GGAAAAGAGT
GATAAGGATA TTACGGATGC CAAAGCGGCT GCTAAAGCCA GCACCTGGAT GGATCGGGCC
AAAACGTATC AGAATATCGC TTCGCAGGGA CAGATGAAAA TCGATTCGGC TGCCGCTACG
ATTGCTTACG ATGCTTTCAA GAAAGTAGTT GAATTGGATA AGGACAAAAA AGGCGGTCCT
GGTAAATTAG CCAAAGAAGC AGAAGAAGCA TTAAAGGCTC CCCTTATGGC CACTGCTTTC
ATGCAGCAGG GCGTTGCCAA GTTCCAGTCG AAGAATTATG CTGACGCGAT GAAGTCGATG
GCCATAGCGG GCGACATTAA TCCAAAAGAT ACGCTGGCAC CGCTTTATAC GGCTATTGCT
GCCCAGCAGA TCCGGGATAA CGCAACGGCT AAAACTCAAC TGGAAAAGTA TATTACCAGT
GGAGGGAAGG ATGCAACCAT TTATGGTTCG TTAGCAATGC TATATCGGGC CGATAATGAA
GTTGATAAAG CCCTGGCTGC TTTGGATAAA GGTATCGCTT TGGCGCCGAC TAATAAAGAT
CTGGCCAATG AGAAGATCAA TATTATGCTG TCGACAAACC GAATGGATGA AGCCATCACG
GGTATGAAAG CGATGGTTGA GAAAGATCCG AACAACGTGC AGAATCTGGT AAACCTTTCG
ATTGTTTATA ATAACATTGC TAACAAGTCT TCTGAGGAGA TCCGCAAGTT AGAAGGAGAC
AGCAAAAAGG GTGGTAACAC CGCCAAGCAA TTAGCCGATG CCAAGAGTAT TATCGACGCT
TATAACGGTG AGATTGCCCG CTTACAGGCA TCGATCAAAA AGTCGCCTAA ACCCGAATTA
AAGCGTCAGC TAACAGATGT ACAGAAGCGT TTGGCAGAGC AGAAGACAGA AGTTGCCAAG
CTTGAAGCTG AGGCTAAAGA AGCGGCAGCA AATGCCGGTG CTGTCGCAGA CGGGGCGAAG
CGGTTAGGCG AACTCAAGCA GGCACAGGCC GAAAATAAGA AACAGGAGAA AATGTACCTG
GATAAAGCCT TGGCCATTGA TCCTAACAAT TACGATGCAA ATTTCAATAC AGCAGTTTTC
CTTTTCAACG AGGCTGTAGA AATGAAGCGT GGTGTAGACC GGATGGATAT GGCTGAGTAC
AACAAAAATG GTAAGGAACT GGATGGTAAA GTTTGCGGTA AGTTCAAGCA ATCGTTGCCA
TACTTCACCA AAGCTAAATC AATTAAAGAT GAAGCCGATG TGAACGAAAA CCTGACTAAT
CTTCAAAACA TTTTGAAACA GTACGAAGAG AAGAAGATTG TATGTATCGA ACCTGGTAAA
TAA
 
Protein sequence
MLIQTESVIS FFKRKHNPSL HMKQVFVIAV ACLLSLGVRA QQAAGDALAM EAFKKEKEKS 
DKDITDAKAA AKASTWMDRA KTYQNIASQG QMKIDSAAAT IAYDAFKKVV ELDKDKKGGP
GKLAKEAEEA LKAPLMATAF MQQGVAKFQS KNYADAMKSM AIAGDINPKD TLAPLYTAIA
AQQIRDNATA KTQLEKYITS GGKDATIYGS LAMLYRADNE VDKALAALDK GIALAPTNKD
LANEKINIML STNRMDEAIT GMKAMVEKDP NNVQNLVNLS IVYNNIANKS SEEIRKLEGD
SKKGGNTAKQ LADAKSIIDA YNGEIARLQA SIKKSPKPEL KRQLTDVQKR LAEQKTEVAK
LEAEAKEAAA NAGAVADGAK RLGELKQAQA ENKKQEKMYL DKALAIDPNN YDANFNTAVF
LFNEAVEMKR GVDRMDMAEY NKNGKELDGK VCGKFKQSLP YFTKAKSIKD EADVNENLTN
LQNILKQYEE KKIVCIEPGK