Gene Slin_2013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2013 
Symbol 
ID8725751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2428036 
End bp2429829 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content51% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003386857 
Protein GI284036927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.292522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGAC GATTTCTTTT ACTGACCCGG ACAGTTCTTG TGTGCTGCGG ATTACTGTGC 
TCGCAGCTTG TATGGGCACA ACTTACATCC GATGCGCAGA AACGCTATAA AGCAGCTCTT
GAACTGGTAA GGACGGGCGA TTACGAACGA GCCAAGTCTG ATCTTAACGT GCTGATTCAG
CAGCGCGGCC CGCTGGCTCC CTATGCCGCC TATCACTATG CGATTGCTGC GTTCCGGCAG
CGAAAGTACC CGCAGTCCCG GGCCATGCTC AAGCAACTGA TGGAGCAATA TCCGGATTGG
CAGAAGATGG ATGATGCTAA CTACCTTTTT GCGGCTAATG GCATGGAACT GGGCCAATAC
GAGGAAGCAT TAACAGCCCT GCAGTCCATC ACGGATGCCG AACTCCGCAC CGACGTTACA
AAGCTTGAGC AGAACTTTAT CCCGCGTATT ACTGACCTGA CCCGGCTAAA GGCTTTGAGT
CAGTCGTTTC CCGCTGACCG GATTATCGGG CTGGCGCTAA TCGACCTGAT CCAACGGACA
GCAACCGATA AGGACGACCT CGAATTGTCT GACCGGTTGA CCAACCGGTT TGGTGTTCCG
CCCGTAACGT CGAGTCAGCC AGCAGGTACG ACATCTCAGG GGGGCGGCTC TCGTCCGGTT
ACCCCTATTT CGCCGAATGG ACGAAATTCA CGGACGAAGG GATATTATAA TGTCGCCGTG
ATGTTCCCTT TTCGGGTTGA CGAGTTTAAT TCGGATAAAC GGTTGAGGTC CAATCAGTAC
GTTTATGATC TCTACAATGG TATTAAGCTC GCAAAAGCTA AGTTGCAGGA AGAGGGGATT
ACGGTCAACC TGTTTGCTTA CGACCTGGAT AATGATGCCA ACAAAGCCCT TGAGCTGGTT
AATAGCCCGG CCTTTGCTCA AACGGACCTG ATCATAGGCC CGCTGTATGT GGAGCCTAAT
CGGATTGCGC TGGCGTACGC AAATCAGCAT AATATTCTAC TGCTCAATCC TATAGCGACC
AGCAGTGAGT TGATCGTTGA TCAGCCGATG TCTTTTCTGG CCCAGCCTTC CATGAACCAA
CAGGCGCGTA AAGTGGCTGA TTTGGTGCGT AGTTTAAATA CCACCCGACG GGCTGCCATC
TTTTTTGGGG CTACCCGGAA AGATTCGTTA CTCGCGGCTT CGTATCAGGC TGAACTCAAA
CGGCAGAACT ACCAGATTAT TGATTTTCGA AAAGTAAGTG GATCGGCACA GCAGATGGCA
GATGCTATGC AACTGTCCGG TACGGCAACG GCTACCCGGT CCGGCAATGC AATTTCTTCG
CAGTCCGGCG GTTCGTCCGT TGGGCATGTG TTCTTTTCCA GTAGCAATGA AGATGATGGC
GTTCGAATGC TCGATGCACT CAGCCGCCGA CGCGTCACGG TACCTTTGAT TTCAACGGCT
TCTGCCTTTG ATCTGTATAA AGTACCCGCT TCGACCTTTA CCCGGCGGGA ACTGTATTTG
TTATATCCCG ATTTTATTGA CAAAAGCCGG GAGCCGGTTA CGGCGTTTGA GGAAGAGTAC
CTTTCCAAAC GAAACACTAT TCCGTCGGTT TACGCGAGTG AGGGGTACGA CATGATGCTG
TTCTTCGGCC GTCAGTTAGC GAAAAATGGC CTTCAGCTGC GTGATCGAAG CACACTTATC
TCCGATACTG ACGATTACCT ACTTTCGGGT TTTGACTATA CACAAAGTAA CGACAATCAA
ATAGTACCAA TCGTAAAATA CGAAGACGGT CGGTTTGTGA AAATTAATGA GTGA
 
Protein sequence
MNGRFLLLTR TVLVCCGLLC SQLVWAQLTS DAQKRYKAAL ELVRTGDYER AKSDLNVLIQ 
QRGPLAPYAA YHYAIAAFRQ RKYPQSRAML KQLMEQYPDW QKMDDANYLF AANGMELGQY
EEALTALQSI TDAELRTDVT KLEQNFIPRI TDLTRLKALS QSFPADRIIG LALIDLIQRT
ATDKDDLELS DRLTNRFGVP PVTSSQPAGT TSQGGGSRPV TPISPNGRNS RTKGYYNVAV
MFPFRVDEFN SDKRLRSNQY VYDLYNGIKL AKAKLQEEGI TVNLFAYDLD NDANKALELV
NSPAFAQTDL IIGPLYVEPN RIALAYANQH NILLLNPIAT SSELIVDQPM SFLAQPSMNQ
QARKVADLVR SLNTTRRAAI FFGATRKDSL LAASYQAELK RQNYQIIDFR KVSGSAQQMA
DAMQLSGTAT ATRSGNAISS QSGGSSVGHV FFSSSNEDDG VRMLDALSRR RVTVPLISTA
SAFDLYKVPA STFTRRELYL LYPDFIDKSR EPVTAFEEEY LSKRNTIPSV YASEGYDMML
FFGRQLAKNG LQLRDRSTLI SDTDDYLLSG FDYTQSNDNQ IVPIVKYEDG RFVKINE