Gene Slin_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1920 
Symbol 
ID8725657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2325686 
End bp2327812 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content51% 
IMG OID 
ProductComEC/Rec2-related protein 
Protein accessionYP_003386764 
Protein GI284036834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.112953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.839642 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGGA ATCCGTTTGT TCGCTACGCG GCTGCCCTAA TTACTGGAAT TGTCCTGTAT 
GTGTACTTGC CCGACTGGAC AGTTATTCCG CTGGTGGCCC TGCTGGCAGG ACTTGGGCTG
CTCCTTTGGG GAATTCGTCG AACAAGTGGC AAACTCGTTA AGCCGATCCA AACGGCATCC
GGCCTGGGTG GTCTGCTACT TCTGTTAGCG TTGGGCTGGT GTATTTCGTA TCAGCGTACG
GCCCGGAACC AATCCGATAA CCTGATTCAT CTCACGGATA CACTTCGAGC CTATGAAGGT
GTGATCATGG CTCAGCCGGA AGAACGGGCC AGAACATTCC GCGTGGAGCT GGCCATTCGC
CGGGGAAAAC GGACTAGTCC AGTAGGAGAC CAATGGCAAC CGCTAAGCGG CCGGGTTATC
GTGTATCTGG ATAAGGCGGG CCAACCCATG CCAAACTACG GAGAGGTCTG GTTAGTAGCG
GGCTCGCCAA GGCCCATCGA CCCACCCTTG AATCCTGGCG AGTTCGACTA TAAACAGTAC
CTCAGCTACC GGAATATCTA CCACCAGCAG TATCTGCGGC CCTACGAGCG AACTATACTG
TCGATTGATC CCCCTAGTCG TATTACGGAC CTTGCCACCC GTGTGAACCG CTGGGCCGAC
AGTGTTTTCA CCCATCAGGT TGGCAACCGG GCTGAATACG GGATAGTCAA CGCCATGATT
CTAGGCGTAA GGGATGACCT GGATACCGAA CTCTACCGGG CCTATGCCGC AGCAGGAGCT
GTGCATGTAC TGTCGGTGTC GGGGCTTCAT GTGGGGATTC TTTTCCTTGT TTTAACCTTT
TTGCTCAGTT TCCTGATTAA ACGACCTCGG GGTAAACTCC TGATGGCTTT TTTACAATTA
ACGATCCTGT GGTTCTACGC ATTGATTACA GGGTTTTCGC CCCCGGTGCT GCGATCAGCC
GCCATGTTCT CGTTACTTAT TATCGCCAAT GCATCCGGTC GTCAACAGCA GTTTATCAAC
TCACTGGCCG CTTCAGCTTT TTTTATTCTC TGTTTCGACC CCTACGCCTT GTTTTCGGCT
GGATTTCAGC TGTCGTATCT GGCCGTGGGT GGAATCGGTA CGTGGCAGTC TCCGCTTTAT
CAGTCGATCA CGTTTCGGTA CAAACTGGCG GATAAAATCT GGGAATTGAC GGCTGTTGCG
CTGGTTGCCC AGCTGATTAC CTTTCCATTA GGCGTTTTTT ATTTTCATCA GTTCCCCACG
TACTTCCTGC TGGCCAATCC GATTGTGATT GTCATGTCGA ACATTCTCTT GCCGCTGGCC
ATGACGACAT TAGCGTTTAG CGGGATTCCT TACGTAAATG AGTTGTTGGG CTGGCTATTG
GAGAAAACGG CCTGGTTTCT CAATTATGCC GTTACGCAAA CCGGTTCCTT GCCCGGTGCC
GCCTGGGATG GACTTTGGAT AAGTCAACTG GCTATGGTAC TGATCTATGT CGTTCTATTT
TGTGGTGTAG CCTTATTGAT TACACGAGAC AGGGTCTATT TATGGGCAAC CAGTCTGGCT
TCTCTTGTCG TTGCCGGTCT AACCATATGG AACGATTTTG AGCAGACCAG GCAACAACGG
CTAGCGGTTC ATTTCCTGCC TCATCGCACG GCGGTCAGCC TTACCGATGG GCATCAGAGT
ACTGTACTAA CCGATCTGGA TGCGAACGAC ACCCGTTCCT TTGACTTCTA CCTGAAAAAT
ACGTTTGGGC AATGGGGCGT TTCGGATCTA ACCATCATTA AGGCCAGTCA AACAAGCAGC
GTTACAGATT CGATACCAAA ACGTCTTGCC TGTTACCAGG ACCGGTCATA TACGTTGTGG
GTCTGGCATG GTATAACAAT CCTACTGGTC AATCAATTGA GCGAATCGTA TTACTGGCGA
TTACCGGCCG TCGTTGACTA CCTCATTATC CGCCGGAATG CCCTGCATGC CTGGAATCAG
CTTGATGGGC GGGTGGTTGC CCGGCACATT ATCTTCGATG ATTCGAATAA GACGCCCCTA
ACGGATAAGT TACTGGCCGA CGCAAAAGAG CTGGGAATCG CCTGTTATTC GGTGCGTCAG
ATGGGAGCTT ACGTAGCTGA TTTATAA
 
Protein sequence
MKGNPFVRYA AALITGIVLY VYLPDWTVIP LVALLAGLGL LLWGIRRTSG KLVKPIQTAS 
GLGGLLLLLA LGWCISYQRT ARNQSDNLIH LTDTLRAYEG VIMAQPEERA RTFRVELAIR
RGKRTSPVGD QWQPLSGRVI VYLDKAGQPM PNYGEVWLVA GSPRPIDPPL NPGEFDYKQY
LSYRNIYHQQ YLRPYERTIL SIDPPSRITD LATRVNRWAD SVFTHQVGNR AEYGIVNAMI
LGVRDDLDTE LYRAYAAAGA VHVLSVSGLH VGILFLVLTF LLSFLIKRPR GKLLMAFLQL
TILWFYALIT GFSPPVLRSA AMFSLLIIAN ASGRQQQFIN SLAASAFFIL CFDPYALFSA
GFQLSYLAVG GIGTWQSPLY QSITFRYKLA DKIWELTAVA LVAQLITFPL GVFYFHQFPT
YFLLANPIVI VMSNILLPLA MTTLAFSGIP YVNELLGWLL EKTAWFLNYA VTQTGSLPGA
AWDGLWISQL AMVLIYVVLF CGVALLITRD RVYLWATSLA SLVVAGLTIW NDFEQTRQQR
LAVHFLPHRT AVSLTDGHQS TVLTDLDAND TRSFDFYLKN TFGQWGVSDL TIIKASQTSS
VTDSIPKRLA CYQDRSYTLW VWHGITILLV NQLSESYYWR LPAVVDYLII RRNALHAWNQ
LDGRVVARHI IFDDSNKTPL TDKLLADAKE LGIACYSVRQ MGAYVADL