Gene Slin_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4031 
Symbol 
ID8727789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4843540 
End bp4846902 
Gene Length3363 bp 
Protein Length1120 aa 
Translation table11 
GC content53% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388820 
Protein GI284038890 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.280222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACC TGTTCAGCCA CGCCCATGAA TCCAGCGCAC CTTCACTTCA CCTGAGTGAT 
GTGCTTCAGG AAGAGTACAA CTTCATCCAC GACATTATCC CTGCTCCTCC CGAAATACCG
TTGACCGATC ATGATCAAGA CCAGCCCGAT TCGGAGGATG TTCACCGGTT CAGGGTCAGT
GAACCGGTCG AATTGTTCAG GAAGCTGGAG GCCGAGCGAG AACACATTGA GGCTACTGCG
AAAACAGCCT TACCGAAACA GACACATACT GAAACATCCT ATACCCGTCT GGCCGAGTGG
CTTGCCGAAA AACCCCTGCG TGAAGACCTC GTCAGAGATA CCCTTATAGC CAAAGCCGCG
AACCCCGATT TATTGACTGA AGAACCCTGG TTACTGAACG CAGCTTTACG GCCTTACACT
CGGGGGTTGC TTAAGCGTTA TCTACAATGG AAAGATGGAC GACACAAGGA CACCACCCAT
TTCCCAACCG ACGAATTGAA CCAGCTTTTG CTGGAAGATG CCTTCGCGGA GCACATCGTG
CCCAGAGAAA AATCCGCATT AGACACGATT TTCGGTAAAA TGCTCGGTAC GTACCAGAGT
GCCTTATGCC TGTCGGGCGG GGGTATCCGA AGCGCCACCT TTGCGCTGGG TGTCATGCAG
GGCCTCGCTC AGCACAACCT GCTTGGGCGA TTCTCCTACC TCTCTACCGT ATCGGGCGGT
GGGTATGTAG GCGGCTGGCT TAGTGCCTGG CGGCACCACG AAGGGCTGGA CAAAGTCATT
ACCAAACTTA AGACTACCAG CAGTACGCCC ATTGCCAGTG AAGCTGACCC GATCCGGCAT
CTGCGCCAAT TCAGCAACTA CCTGAGCCCG CAGTTGGGCC TATTCTCCGC CGACACCTGG
ACACTCGTTG CCACCTATAC ACGTAACCTG CTGTTAATCT GGCTGGTGAT TCTGCCCTTT
CTGGCGGCTC TCGCGGCCGT TCCCTGGGTA GGCGTTACCC TGGCCTCTGC CAAACTGAAT
CCTGGCGATA CCATTTGGTT CTGGATTATG GGGAGTTTAC TGGCGGTAGC GGGTGCGTTG
TCCGTGATGG CTGTCTATTT TGTCCATTCG TACATACCGA CACCCGAAAC AAAAAAACCG
TCCGAAAAAA AACTCACCGA CATTCCCCTG AAATCAGACC GGGATCAAAC CGCGTTCATT
AACAAGTGCC TGCTGCCGTT CTCCGTCGCT GTTCTTTTGT TGATACTGGT GTGGATCTGG
TTCACTAAAC TGGACTCAAC CAGTACACAT TGGGCGGGCA ATATATATAG ATCGCTGGGA
CTCAACCAGC GAACCGGACT GAACATTTTC TCGGAGGGAG GCTACTGGAT TATGGGCGGT
ACCACACTGG CCCACGTCAT AGGCTGGCTA CTGGCCCGGC CAACACCGAA GAAGTTTTAC
TTACAGTTCC TGATGTTTCT GGTGATCGCC GTTGTAGGAG CTATGGCAGG TTTTTTGTTA
TTGCTGACCG CCAAACTGTT GCGTTCATCA TCCATAGAAT TGTACACCTG TCTGGCTTTT
CCCTGTTTTA TGCTGTCCAT TTTGCTGGTT GGTTATTTCT TTGAAGGCGT TGTCAGCCGA
TACCTCGACG ATGCCCGACG CGAATGGACG GCCCGGTACA GCGCCTGGCT GCTGATTGCG
GCCCTGGGCT GGCTCGTTTT ATCGAGTGTC ATCCTGTTTG GACCGGGGCT TATTGACGCC
ATAAAGCTGC AAGTCGCCAG CATTGGCCTT GGTTCGGGTA TTTTAACGGC TCTGCTTGGA
GGAAGTGCGC AGAGTGCCGG GCGGGGCGAG GGTGCCGGAT CGCGTCAGGG AAAAGGCAAC
GCGTCGGGCA TCATCGGGTT ACTATCCCAA TTCAGCCTGC CCATTGTCGC TACGCTGACC
ATCTTTATCC TGATGGTCAT GCTTTCGCTG CTGAACCAAA CGCTGGCCGG GCTGCTGACG
GATCAGCTTT ATGACTGGTT CGGCGACAAT ACATCTGGCC GTATCCTAAC CGTTTTTACC
CCTCTGATTC TGCTCATCCT CTTTCTGGTG GCAGGCTGGC TGCTGGCTCT GATGATCGAT
ACCAATCGGT TTTCGCTCCA TGCGATGTAT AGAGCCCGGC TGATCAGGGC GTATCTGGGG
GCCTCGCGTC CGCAGGAAAC ACGCACCCCC GACCCGTTCA CTGGCTTTGA TGAAGACGAT
AATATTCCCA TGGGCCAATT GAAGGTGGAT TCCTATACGA CACCGACTAC AAATACGCCG
AACGAGGTTG GCCCCGAAAC TAAACCAGAA GCAACGCCAA AGAAGCCCCT GTTTCATATT
ATAAATCTGG CTCTCAACCT GGTAAACGGG CAGAATCTGG CCTGGCAGGA GCGAAAAGCG
GAAGCGTTTT CCATTTCACC CCTGCACGCC GGAGCCATGA ATCTGGCGTA TCGGCGAACC
CGCGTCAAAA TCAACCCTAC TGATTACCGC TCCGGGCAAG AAAACCCGGC TTTGTCGACA
CCGGAGTATA ACTGTTATGG CGGTAAAAAA GGTATCAGTT TAGGTACAGC CATAACCATA
TCGGGAGCGG CTGCCAGCCC GAATATGGGT TATCACTCAT CTACTCTGGT TGCTTTTCTG
ATGACCTTAT TCAACGTCCG GCTTGGCTGG TGGCTGGGAA ACCCCGGTCC GGCGGGCGAC
AAGACGTTTG ATAAGTCGAC GCCCGACCTG GCCGTTAAAC CCATCTGGGA TGAACTTCAG
GCCAATACCG ACGATACTAA CGAATATGTG TACCTGTCGG ATGGCGGGCA CTTCGAGAAT
CTGGGCCTTT ATGAAATGGT GTTACGGCGC AACCGGTTTA TTGTCGTGAG CGACGCCAGC
TGCGACGAAT CCTGTACGCT GGAAGACCTG GGCAATGCAA TCCGTAAAAT CCGTATTGAC
CTGGGTATAC CCATCGAATT TCAGGGCAAC TTCCCCATTC AGGCCCGGTC AACCAATGGG
GTCAATGCAG AAGGAAAATA CTGGGCACTG GCCCGCATTG GCTATTCCGC CGTTGATAAG
CCGACTGCTG CAACAGACCC CGACGAGGTG GATGGTCTGC TGCTTTACAT TAAACCTGCT
TTCTACGGCA ACGAACCCCG CGACATATTC AATTATGGCT CTACCAAAAG TGCTTTCCCC
CACGAATCGA CGTCGGATCA GTTCTTTTCA GAAAGTCAGT TTGAAAGCTA CCGGGCGCTG
GGCAGACATG CTTTCGAGAC CATGCATACC AGCTTCAAAA AAGAAGCCGG TGTAGAATTA
AATGAACTGT TTACGAAAAA TGGACTTGCC CTCCACTGGA AGTTCATGAA GACCAAAAGC
TAG
 
Protein sequence
MPDLFSHAHE SSAPSLHLSD VLQEEYNFIH DIIPAPPEIP LTDHDQDQPD SEDVHRFRVS 
EPVELFRKLE AEREHIEATA KTALPKQTHT ETSYTRLAEW LAEKPLREDL VRDTLIAKAA
NPDLLTEEPW LLNAALRPYT RGLLKRYLQW KDGRHKDTTH FPTDELNQLL LEDAFAEHIV
PREKSALDTI FGKMLGTYQS ALCLSGGGIR SATFALGVMQ GLAQHNLLGR FSYLSTVSGG
GYVGGWLSAW RHHEGLDKVI TKLKTTSSTP IASEADPIRH LRQFSNYLSP QLGLFSADTW
TLVATYTRNL LLIWLVILPF LAALAAVPWV GVTLASAKLN PGDTIWFWIM GSLLAVAGAL
SVMAVYFVHS YIPTPETKKP SEKKLTDIPL KSDRDQTAFI NKCLLPFSVA VLLLILVWIW
FTKLDSTSTH WAGNIYRSLG LNQRTGLNIF SEGGYWIMGG TTLAHVIGWL LARPTPKKFY
LQFLMFLVIA VVGAMAGFLL LLTAKLLRSS SIELYTCLAF PCFMLSILLV GYFFEGVVSR
YLDDARREWT ARYSAWLLIA ALGWLVLSSV ILFGPGLIDA IKLQVASIGL GSGILTALLG
GSAQSAGRGE GAGSRQGKGN ASGIIGLLSQ FSLPIVATLT IFILMVMLSL LNQTLAGLLT
DQLYDWFGDN TSGRILTVFT PLILLILFLV AGWLLALMID TNRFSLHAMY RARLIRAYLG
ASRPQETRTP DPFTGFDEDD NIPMGQLKVD SYTTPTTNTP NEVGPETKPE ATPKKPLFHI
INLALNLVNG QNLAWQERKA EAFSISPLHA GAMNLAYRRT RVKINPTDYR SGQENPALST
PEYNCYGGKK GISLGTAITI SGAAASPNMG YHSSTLVAFL MTLFNVRLGW WLGNPGPAGD
KTFDKSTPDL AVKPIWDELQ ANTDDTNEYV YLSDGGHFEN LGLYEMVLRR NRFIVVSDAS
CDESCTLEDL GNAIRKIRID LGIPIEFQGN FPIQARSTNG VNAEGKYWAL ARIGYSAVDK
PTAATDPDEV DGLLLYIKPA FYGNEPRDIF NYGSTKSAFP HESTSDQFFS ESQFESYRAL
GRHAFETMHT SFKKEAGVEL NELFTKNGLA LHWKFMKTKS