Gene Slin_4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4237 
Symbol 
ID8727996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5110702 
End bp5112438 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content51% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003389020 
Protein GI284039090 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.46126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAATA CTAAAAACAT TGTCAGGACA CTGGCACTGG CTGGAGTTAT AAGCATAGCC 
AGCTCGTGTA ACTATACCGA TCTGGATATC AATACAGATC CCAACAAACC GGCAACTGGT
TCGCTGGGTC TGCTGTTGCC GGTAGCTGAA AATGCCGCTC GTGACGCGTT TACGAGTGTC
AATAGCGGTG CGATGAGTTT TGCGGGTTTG TGGAATTCCA GTAATGCAAC CACCTCCTAC
AACCTGAGCA ATACTGACTT CCAGACCACT TGGAATGATG CGTATCGAAA TATGCAGAAC
ATGGAGGAGA TGCTGCGGGC TACGGAAGAC GGTAAGAATC CCCGCTATCG GGGAATTGCC
CTAGTGCTCA AGGCCTATGC GATGGGTAAC TATGTGGATA TGTTTGGCGA TATGCCGTAC
ACAGAAGCCT GGAAAGGTAA TGCAGCCCAA CAGAATACAT CCCCTGTATT CGATAAGGAC
GCAGCTATTT ACGAGGATCT GATTAAACTG TGCGATCAGG CCGTTGTGGA ACTGGCTAAA
CCCCAGCCCG TTGCCGTAAT CAATGACTAC ATGGGTGGAG GTAATGCAAC GACCTGGACG
CGCATTGCCA GAACGGTTAA GCTGCGTTTG TTACTTAACT CGCGCAAAGG CCGTACCAAT
GGTAATGCTG AATTGAAGGC GGCTTTCGAC GCAGGTGGAT TTATTTCAAC ACCAGCGCAA
AACTGGTCCT ACCTGTATTC CAAGCAGATT TCACCCGAGC GCAACACCCA CCCCTGGTTT
ATTACCTACA CGGGTACGTC TGATCCTAAC TACATCAACC ACCAGTTGAT GGGCGAAATG
ATTCTGAATA AAGATCCACG TCTGCCTTTC TACTTCTACC GGCAAACATC ACGGATTCTG
GACCAAAACA ACCCAACCGA CCGGGGTACC ACGCCGTTTG GTGGATCGTA CCTGCCGCTC
CGGGCGTCAT TCCTCGATGA ATACAAGAGC GTATTTGGTA TCACCGGAGC TATTCCTACT
GCTGATCTGG CTTACATCGC CGGTTACTTC GGACGCGATC GGGCCGATGT ATCGGGTGCT
GCTGCTGATG GTCCACTACG GACGGCCCCC GGCACCTATC CGGCGGGTGG CGTTTATTCT
GATCGGAGCG TACCTGCTGT AGCGCTGACG GGTCAGGCAT CGCTGAACTT TGGTGGCGAT
GGTATGTGGC CACTGATCCA TAGCTGGAAC ACGAAATATT ATCAGGTGGA AGCTATTCTG
GACGGAACTG GTGTAACAGG TGACCCCAGA GCTCTTTTTG AAGCTGCTAT GCGTGAGCAG
ATTGCCGTGG TTGTGGCTCA AGGCCTTAAA TCAGATCCCA CCCGTGCCAA AGCACCGGCC
CAGACCGAAG TGGATGCCTA TGTAAAAGCA TGGCTGGATT TGTATGATGC GGCTACATCC
GCCCAGTCAA AATTGAACGT GGTCGCCAAG CAAATCTGGT TCTGCTCCTG GGGACAGGGT
ATGGATATCT GGAACTTACA GCGCCGAACC GGCTACCCCA TTCAAAGCCA GTTCAAACAG
TTCTCGGTAG GGATTCAGGC TCCAATTTCA AAGCCACCCC GTCAGTATGC GCTTCGCTTA
CCCTATCCGC AGTCTGAAGG TGCCCTGAAT CCGAATGCAG CTAAATACGT TGCCGATGTG
ATTTTCGACC GGGATCCAAT TTTCTGGGAC AAGGTAAAAG TTAAGTGGGA GTACTAG
 
Protein sequence
MINTKNIVRT LALAGVISIA SSCNYTDLDI NTDPNKPATG SLGLLLPVAE NAARDAFTSV 
NSGAMSFAGL WNSSNATTSY NLSNTDFQTT WNDAYRNMQN MEEMLRATED GKNPRYRGIA
LVLKAYAMGN YVDMFGDMPY TEAWKGNAAQ QNTSPVFDKD AAIYEDLIKL CDQAVVELAK
PQPVAVINDY MGGGNATTWT RIARTVKLRL LLNSRKGRTN GNAELKAAFD AGGFISTPAQ
NWSYLYSKQI SPERNTHPWF ITYTGTSDPN YINHQLMGEM ILNKDPRLPF YFYRQTSRIL
DQNNPTDRGT TPFGGSYLPL RASFLDEYKS VFGITGAIPT ADLAYIAGYF GRDRADVSGA
AADGPLRTAP GTYPAGGVYS DRSVPAVALT GQASLNFGGD GMWPLIHSWN TKYYQVEAIL
DGTGVTGDPR ALFEAAMREQ IAVVVAQGLK SDPTRAKAPA QTEVDAYVKA WLDLYDAATS
AQSKLNVVAK QIWFCSWGQG MDIWNLQRRT GYPIQSQFKQ FSVGIQAPIS KPPRQYALRL
PYPQSEGALN PNAAKYVADV IFDRDPIFWD KVKVKWEY