Gene Slin_4723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4723 
Symbol 
ID8728487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5752104 
End bp5755343 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content57% 
IMG OID 
ProductNHL repeat containing protein 
Protein accessionYP_003389500 
Protein GI284039570 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.645184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCTAC TGAAACCAAT TCAAATGCTA AAATTTTACC CGCCAACCGG CCTCGATAGG 
TCGGCGACTA ACTCGTTTCG TCCTGACGCC AGTTGGGTCA GCCGACTAAC CTGCTTAGTC
GTATTTCTTT TACTGAGCCT GAGCAGCCCA TTGCTAGCGC AATATAATTC CAACGCCGTA
ACAGTTGCCG GTACCGGTAC CGCTGGTTCA GCCGCTAATC AGTTAGATAG TCCCTTCGGT
ATTTATGTTG ACGAAGCCGG AAGCATGTAT GTAGCTGATT ACAACAACCA CCGCGTCCAG
AAATGGGCCT CGGGGGCTAC CTCCGGAACT ACTGTAGCCG GTACCGGTAC CGCTGGTTCA
GCCGCTAATC AGTTAAATCA TCCTCTTGGT GTGTATGTAG ACGGAGCGGG AGCCATCTAT
GTGTCTGACA CCGATAACAA TCGCGTCCAG AAATGGGCCT CGGGAGCTAC CTCCGGAACT
ACAGTGGCCG GTACCGGTAC CGCCGGTTCA GCTGCGAATC AGTTGAATTA TCCTATAGGT
ATCTATGTAG ACGGAGCGGG AGCCACCTAT GTGGCTGATG CTTCTAACTC ACGCATTCAG
AAATGGGCTG CGGGAGCCAC ATCCGGTACT ACCGTCGCCG GGGGGAATGG ACAAGGCTCA
GCTGCCAATC AGCTTTGGAG TGCAGCGGGT GTGTATATAG ACGGAGCAGG AGCCATTTAT
GTGGCTGACG GCGGAAACAA CCGCATCCAG AAATGGGCCT CAGGGGCCAC ATCCGGTACC
ACCGTGGCCG GTACCGGAGT ATATGGACCC GCCAGCAATC AACTATCATA CCCATTTGCT
GTGTATGTTG ATGGAGCGGG CACGATGTAT GTGAGCGACC AGCAATCTCA CCGCATTCAG
AAATGGACTG CGGGGGCGAC CTCGGGTACC ACCGTAGCCG GGGGGTACGG CAATGATGCT
CTGGTGCCTT ATCAGTTAAA TTACCCAAGA GGTATTTATC TTGACAGGGC CGGAGCCATC
TATGTAGCCG ACCAACGCAA CAACCGCATC CAGAAGTTTA GTCGCATCAA TCCACTCGTC
AACCCTTCCC TGACCATCGC CACAACGAGT CAGACCAGTT GTACAGGCGC ATCGGTGAGC
TTCACCGCCA CTTCCACCAA TGGCGGCACC AGCCCCGCTT ACCAGTGGAA AAAGAACGGC
TCCAATGTGG GCACCAGTGA CGCTACCTAT ACCGATGCGG CCCTGACTAG TGGCGATGTC
ATTAACTGCG TGCTCACCAG CAATGATCAA TGGGCCTCCC CTACCACGGC CACCAGCAAT
AGCCTGACGA TGACGGTCAA CCCGCTACTT ACCCCGGCCC TCACCCTTGC CATTACCACC
GGCAGTCAGA CCAGCTATGC GGGCACCGCC ATCACGTTTA CCGCTACGCC TATCAATGGC
GGTACCAACC CCAGATACCA GTGGAGAAAG AACGGCACCA ACGTGGGCAC CAACAGTGCC
ACCTACACTG ATTTCGCCCT GGCCAACAAT GACCTGATCA GCTGTATACT TGTCAGCAAC
GTCACCTGCT ATACCACGCC CACCGCTGAC AGCAATAGCC TGAAGATGAC CGTCATCGCT
CTGGTAACGC CTACCCTCAC CATCGCCACC GGTAGCCAGA CCAATTGCGC AGGCAAATCA
GTCAGTTTCA CCGCTACGCC TACCAATGGC GGTAGCAGCC CCGCCTACCA GTGGAAAAAG
AACGGCTCCA ACGTGGGCAC CAACAGTGCC ACCTATACCG ATGCCGCCCT GGCCAACAAT
GACGTGATCA GTTGTGTGCT CACCAGCAAT GCCCCCGGAA CTACGACCAG CACGGCCACC
AGCAACAGCC TGACGATGAC GGTCAACCCG CTACTTACCC CGGCCCTCAC CCTTGCCATT
ACCACCGGCA GTCAGACCAG CTATGCGGGC ACCGCCATCA CGTTCACCGC TATGCCTACC
AATGGCGGTA CCAACCCGGC CTACCAGTGG AAAAAGAACG GCTCCAACGT GGGCACCAAC
ACGGCGACCT ACACTGATTT CGCCCTGGCC AACAATGATG TTATCAGTTG TGTGCTCACC
AGCAACGTCA CCTGCCCCAC CACGCCCACC GCTACCAGCA ATGACCTGAC CATGACCGTC
ATCGCTCTGG TAACGCCCAC CCTCACCATC GCCACCACCA GTAGCCAGAC CAGTTGTGCA
GGTACATCAG TCAGTTTCAC CGCTACGCCT ACCAATGGCG GTAGCAGCCC CGCCTACCAG
TGGAAAAAGA ACGGCTCCAA CGTGGGCACT AACACGGCGA CCTACACCGA TGCCGCCCTG
GCCAACAGTG ACGTGATCAG CTGTGTGCTC ACCAGCAATG CCCCCGGAAC TACGACCAGC
ACGGCCACCA GCAATAGCCT GACGATGACG GTCAACGCCC GGCCCGATGC GCCCGCCCTG
ACCCCCGCCA GCTCTAGTCT GGCAGCGACT CTGACACCCC TCTCGCTGAC GAGCTTTGCA
CTGGCAACCA CCGGCAATAG CCTCCACTTC TTCCAAGCCG GAGGTAGTGA ACTCAGCCCC
CCTACCGTCA GTATCGCCAC TGCCGGGGTT ATGAGCTTTT CGGTCGGCCA GACCAACAAC
GCCAGCGGCT GCAAGAGTTT ACTCACGCCA TTGAGTCTGA CCATCACGGC CACCCCCACC
AGCCAGACCG TTTGCCGCAG TAGCAACGCC ACTCTGAACG TCACTCTGGT GGGAACCGCC
TTCCAGTGGT ACAAAAACGG TACTACCACA GCCAACAAAC TCACCGAGCT GACCAGTGCC
CAGCGCGGTA CGACCACCGC CACCCTGACA CTGGTCAATT TGCAAACCAC CGCCGACTAC
TACTGCAAAA TCACTACTTC CACCGGCGTT CAGACCGTGG GGCCCCTGAA GGTGAGTGTC
AACTTTGGCT GTTCGGCCCG GCCTGCGGCC GAGGAAGCAG ACTTGCAACT ATTGGTACTG
GTCAGGCCAA ACCCTATCGT AGACGGCCAC CTGCGGGCCC TGGTGAAGGG GGCTCAGGGG
CAAGCCCTGA ACGTAGCCCT CTACAGTCTG CAAGGGGAGT TGGTGAACCA GCAGGTCTGG
CCCTCGGCAC CCGCCGAAGT CAATCTGGAT TGGGACATCA GCCAGCGAAC CACGGGAGTG
TTACTCTTGC GGGCCCAGAC CCCAACTCAA CAGCAAACCA TCAAGATTAT CCAGAATTAA
 
Protein sequence
MYLLKPIQML KFYPPTGLDR SATNSFRPDA SWVSRLTCLV VFLLLSLSSP LLAQYNSNAV 
TVAGTGTAGS AANQLDSPFG IYVDEAGSMY VADYNNHRVQ KWASGATSGT TVAGTGTAGS
AANQLNHPLG VYVDGAGAIY VSDTDNNRVQ KWASGATSGT TVAGTGTAGS AANQLNYPIG
IYVDGAGATY VADASNSRIQ KWAAGATSGT TVAGGNGQGS AANQLWSAAG VYIDGAGAIY
VADGGNNRIQ KWASGATSGT TVAGTGVYGP ASNQLSYPFA VYVDGAGTMY VSDQQSHRIQ
KWTAGATSGT TVAGGYGNDA LVPYQLNYPR GIYLDRAGAI YVADQRNNRI QKFSRINPLV
NPSLTIATTS QTSCTGASVS FTATSTNGGT SPAYQWKKNG SNVGTSDATY TDAALTSGDV
INCVLTSNDQ WASPTTATSN SLTMTVNPLL TPALTLAITT GSQTSYAGTA ITFTATPING
GTNPRYQWRK NGTNVGTNSA TYTDFALANN DLISCILVSN VTCYTTPTAD SNSLKMTVIA
LVTPTLTIAT GSQTNCAGKS VSFTATPTNG GSSPAYQWKK NGSNVGTNSA TYTDAALANN
DVISCVLTSN APGTTTSTAT SNSLTMTVNP LLTPALTLAI TTGSQTSYAG TAITFTAMPT
NGGTNPAYQW KKNGSNVGTN TATYTDFALA NNDVISCVLT SNVTCPTTPT ATSNDLTMTV
IALVTPTLTI ATTSSQTSCA GTSVSFTATP TNGGSSPAYQ WKKNGSNVGT NTATYTDAAL
ANSDVISCVL TSNAPGTTTS TATSNSLTMT VNARPDAPAL TPASSSLAAT LTPLSLTSFA
LATTGNSLHF FQAGGSELSP PTVSIATAGV MSFSVGQTNN ASGCKSLLTP LSLTITATPT
SQTVCRSSNA TLNVTLVGTA FQWYKNGTTT ANKLTELTSA QRGTTTATLT LVNLQTTADY
YCKITTSTGV QTVGPLKVSV NFGCSARPAA EEADLQLLVL VRPNPIVDGH LRALVKGAQG
QALNVALYSL QGELVNQQVW PSAPAEVNLD WDISQRTTGV LLLRAQTPTQ QQTIKIIQN