Gene Slin_6061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_6061 
Symbol 
ID8729842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7358064 
End bp7359494 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content53% 
IMG OID 
ProductCurli production assembly/transport component CsgG 
Protein accessionYP_003390822 
Protein GI284040892 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAC TTTTTTTCAA AGCTATCCAG CTCCTTATGC TGACTGGCTT GGTGGGCATT 
TTGTCGGGGT GTACCGCTTT TTTCTTTCAA CCCACAAAAG CCCGGCGGGC TCGGTTAGGG
GAAGAAACAC CCGTTACGGC AGACTTACAC CGACTGCCGG TTGCCAAAGA AAAAATCATC
ACTGCCGTTT ACAAGTTCCG GGACTTAACC GGTCAGTACA AGCAGATTGA AGCGGGTTCG
ACCTTCTCCA CCGCTGTTAC CCAGGGAACC ACCAACATAC TGCTTAAAGC GCTCGAAGAG
AGTGGCTGGT TTTTACCCAT CGAACGCGAG AATGTCAGCA ATCTGCTCAA TGAGCGGAAA
ATTGTACGCT CCAGTGTAGC CCAGTATAAA GAAGGCGAAA ATCTACCTCC TTTGCTGTTT
GCCGGTATTA TTCTGGAAGG GGGCGTTGTT TCGTACGATG CCAATATCAT CACCGGTGGT
GGCGGACTTC AGTACTTCAG CGCCGGTGGC TCAACACAGT ACCGGCAGGA CCGGGTAACG
GTTTACCTGC GGGCTGTATC CACAAAGTCG GGGAAAATTC TCAAAACGAT TTATACCTCC
AAAACCATTC TGTCGCAAAC GGTCAACGCC AGCCTGTTTC GGTATGTGAC CTTCAAACGG
CTGCTGGAAA CCGAAACGGG CCTGACCACG ACCGAACCGG GTCAGCTGGC CGTTACGGAA
GCCATCGAAA AGGCCGTTCA GGGGCTCATT ATCGAGGGTG TGCGGGATGG ATTGTGGCTC
CCGGCCGATA ATCAGGTTGC CGCCATGCAG GCGGTTGTCA AGGACTACGA GAAAGAGAAA
ACCGTAATGA GCGAAACGGA TGTGTACGGC ATACGTCCCG AAGTAGCCCC GCCTTTTCTA
AGCGTACACA CCTATGCCGG TGCGATGCGA TATTATGGCG ACTACGCTCG CCGTACCATC
AAAGGAAGCT ACGGGGCTTC TGTCGATTTC CACATTACGC CCGCCTTTGG CTTGCAGGTG
AACGGTGCTA CCGGCGTTCT GGCCAGCGAA GGGGCTTTTT CGACGAACAT CACCTCCCTG
GAAGGTAATC TGATTCTTCG TTTAACGCCC TATCAGCGCT GGACATCCCT CTTATTTGCA
GGTGCCGGTG CTGTTTCGCA GAGTGGCTCA TCACCCTTCC AATGGCGGGG AGCCAGTTAC
TTACAGGCTC AGGGAGGAGT TGGCGTGCAG TTTTCGCCAA GTAAGGTAAT CGGATTCCGC
TCTACTTTAT CTTATAACCA GCCTTTTACG GATGCGCTGG ACAGTAAAGT GGCCGGAACC
CGTAATGACT ACTACCTACG GGGAACGCTG GGGGTGGTTT TCCATATCGG CCGCTTCTCG
CCACCAAAAG TAAAGCCACT CACTCCGCAG CAGCCGGTAA TGAACAAATA A
 
Protein sequence
MKVLFFKAIQ LLMLTGLVGI LSGCTAFFFQ PTKARRARLG EETPVTADLH RLPVAKEKII 
TAVYKFRDLT GQYKQIEAGS TFSTAVTQGT TNILLKALEE SGWFLPIERE NVSNLLNERK
IVRSSVAQYK EGENLPPLLF AGIILEGGVV SYDANIITGG GGLQYFSAGG STQYRQDRVT
VYLRAVSTKS GKILKTIYTS KTILSQTVNA SLFRYVTFKR LLETETGLTT TEPGQLAVTE
AIEKAVQGLI IEGVRDGLWL PADNQVAAMQ AVVKDYEKEK TVMSETDVYG IRPEVAPPFL
SVHTYAGAMR YYGDYARRTI KGSYGASVDF HITPAFGLQV NGATGVLASE GAFSTNITSL
EGNLILRLTP YQRWTSLLFA GAGAVSQSGS SPFQWRGASY LQAQGGVGVQ FSPSKVIGFR
STLSYNQPFT DALDSKVAGT RNDYYLRGTL GVVFHIGRFS PPKVKPLTPQ QPVMNK