Gene Slin_1165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1165 
Symbol 
ID8724898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1418091 
End bp1419590 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content52% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003386015 
Protein GI284036085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.465388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTCA GAAAACTTTC GATCGGGGTA TTGGCGGCTT TGATGACCTT TTCGTCCGGT 
TGCCAGAAGA ATTACCTCGA CATCAATAAT AACCCCAACC AGGTAACAGC CGCTACGCCG
GTACTGGTAT TGCCATCTGC CCTTAGTTCG TCAGGGGCGT ACTTCTCGAC CAGCTTCACC
TTCCTGAACC TCTGGATGGG CTACTGGAAC TGGAGCGGTA ACTACTCCAT TGCTACGTCG
GACAAAAACT ACCAGTTTAC CTCCAATTTT GGTACGGGCA TCTGGGATAA TGGGTATATC
ATCCTAAAAA ACTACGATTA CACCGAAACC CAGGCCGCTA CGCTAAAGCA ACCTCTGGTA
CAGGGAATTG CCAAAATCAT GAAGGCACTG CACTTTCAGA TTCTGGTGGA TACGTACGGA
AATATTCCCT ATACGCAGTC GCTACAGGGA TTAGCGTCCT CGCAACCAGC CTACGATAAC
GCTACGGCGG TTTATGAGGA TCTCTTCAAA CAAATCGATG CAGCCCTGAC CTTGTTCAGC
GATGCGGATA AACTGGCGGC TCAGGGCGGT ACGGTGCTTA ATCCGGGAAC CAACGACATT
ATGTTCAAAG GGGACATTGC CAAATGGCGT CGGTTTGCCA ACACCCTGAA ATTACGGATG
CTGCTGCGTC AGTCGGAGAA GGCCGACCGG CAGGCATTTA TTCAAACACA GCTGGCAACG
ATTAAAGCAT CGGGCTATGG CTTCCTGGGA AGTGGCGAAA ATGCGGCCGT CAACCCCGGC
TATGCGAATT CGGCAGGTCA GCAAAGCCCG TTTTACGGCA CCTACGGCTA TCAGGTGAAC
GGCCAGCCAA TTGAAGCCTA TAACATTAAC CGGGGCAACA AGTACGCCAT TGATTTCTAT
CAATCGACTA ACGATCCCCG GTTAGCCCGG TTCTACGCTC CGGTTGGTGG GTCAGGGACT
ACCTTCAACG GTACGTTTTT CGGTACCATC GCGCCAGTGG CAAACAACCA GACATCGGGC
ATTGGTCCCG GTTTACTGAG CGATGTCAAT CAGCCCGCTT ATATTCTGAC CTCGCACGAA
GCTCTTTTCC TACAGGCCGA AGCCGCCCAA CGGGGCTGGA TCGAAGGTGA CCCGAAAACG
CTCTATCAGC AGGCGATCAC CGAATCATTT GTCAATGTAG GCCTTACAGC GGCCAATGCA
GCAGCGTATT ATGCACAGGC AACGGCAAAT GTGGGTTGGG ATGCTTCGAC GAATAAAATT
CAGCTCATTA TTACGCAGAA GTGGGCCTCG CTCAATGGAT GGTCACCGTT TGAGGCCTGG
TCGGACTACC GCCGACTCGG GGTTCCGAAC GTGCCTATTT CGCAGGACCC CAGCACATCC
GTCAAACAGA TACCAGTCCG ACTGTTTTAC CCACAGAGCG AATACAACTT CAATAGCGGC
ATGGTGCGGC AGCAGGGCGA TATTAACCAG TTTACATCGA AGATATTCTG GGTAAAATAA
 
Protein sequence
MNLRKLSIGV LAALMTFSSG CQKNYLDINN NPNQVTAATP VLVLPSALSS SGAYFSTSFT 
FLNLWMGYWN WSGNYSIATS DKNYQFTSNF GTGIWDNGYI ILKNYDYTET QAATLKQPLV
QGIAKIMKAL HFQILVDTYG NIPYTQSLQG LASSQPAYDN ATAVYEDLFK QIDAALTLFS
DADKLAAQGG TVLNPGTNDI MFKGDIAKWR RFANTLKLRM LLRQSEKADR QAFIQTQLAT
IKASGYGFLG SGENAAVNPG YANSAGQQSP FYGTYGYQVN GQPIEAYNIN RGNKYAIDFY
QSTNDPRLAR FYAPVGGSGT TFNGTFFGTI APVANNQTSG IGPGLLSDVN QPAYILTSHE
ALFLQAEAAQ RGWIEGDPKT LYQQAITESF VNVGLTAANA AAYYAQATAN VGWDASTNKI
QLIITQKWAS LNGWSPFEAW SDYRRLGVPN VPISQDPSTS VKQIPVRLFY PQSEYNFNSG
MVRQQGDINQ FTSKIFWVK