Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_6039 |
Symbol | |
ID | 8729820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 7323261 |
End bp | 7326440 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003390800 |
Protein GI | 284040870 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.29047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACACA TTTTAACGAA CGAAAGGAGA CGGTACCTCA CGCTCCTGAT GGCTATCTGG TTCCTCACTA ACCTCTCCAC TCTGGCGCAA AATTCGGGGC GTACCATCAC CGGAAAAATC CTGTCGAAAA CCGATGGAGC GGGGCTTCCC GGTGCCAACG TACTGGTGAA AGGCTCATCG GTTGGGGCGG TGACGGATGC CGCAGGTAGC TTTTCGATCA ATGCCCAGCC CAACGCCACC CTAGCGGTCT CTTACATTGG TTTTGTTTCG CAGGAAATTG CGATTGGCAA CCAGACCGAG GTGGTGATTT CGCTGGCCGA AGATGCGTCC CAACTAAGTG AAGTGGTCGT TACGGCGCTT GGCATTTCGC GGGATAAAAA AGCACTCGGG TATAGCCTTC AGGAGCTGAA GGGAAACGAA CTCACACAGG CCCGGCCAAC CAACCTGGTC AACGCCTTGT CGGGTAAGAT TGCCGGTATT CAGGTGACGG CAACGAACGG ACTACCCGGC GCATCGTCGC GGATTCTGAT TCGCGGAGCC AACTCCATCG GAGGCAATAA CCAGCCGCTG TTTGTGGTCG ACGGGATTCC GATTGATAAC GGCAGTTACA ACGTAACGCC GGGAAGTACG GGCGGAAACG TCAACAACGT AACGACGGAT TACGGCAACG GCGCATCGTC CATCAACCCC GATGACATTG ATAATATTTC AGTCCTGAAA GGTGCCAATG CCGCTGCGCT GTATGGCTCG CGAGCGGCCA ACGGAGTTAT TCTGATCACG ACCAAACGGG GTTCGGCCAG CAAGAACATT GGCGTAACCG TAAATACCAA CACCACGTTC GAAAATCCGC TGCGGCTTCC CGATTTTCAG AATGAATACG GGCAGGGACT TAAAGGGCAG TTTTCGTACG TCGATGGCAT GGGCGGGGGC GTCAATGACG GCGTCGATGA AAGTTGGGGG CCAAAACTGG ACGGGCGGCT GATTCCGCAA TTCAACTCGC CCATTGGTGC CGATGGCAAA CGCACGGCCA CACCCTGGAT TGCCCGGCCC GATAACGTCA AGAACTTCTA CGATACGGGC GTTACTACCA CGAACAGCAT TGCGCTCACC GGCGGTAACG AGAAAGGCGA TTTTCGACTG GGTTATACCA ATCTTTACCA GAAAGGCATG CTGCCCAATA CGAACTACAA ACGGCAGAAT CTTTCGTTCA ATGCAGGCTG GAATTTCACG CCGAAATTCA CCGTCCGGAC CAGCATCAAC TACATAAAAG ACGGTTCCGA CAATCGGCAG AACCTGAACC TCTACTGGAT ATGGTTCGGT CGGCAGGTCG ATCTGGAAGA CCTCAAGGGC AATCCGGTCC AGCCCGATAC CGACCCGAGC CAATGGCCCG TGCAACGCAA CTGGAATTTG AACTACTGGA ATAATCCGGC GTATGCGCTG AAGTACCTGA AATACGCCAA CGATAAAGAT CGCCTGATTG GCAACATTAC CGCCACCTAT AAACTGACCG ACTGGCTGAC ACTGACGGGC CGCACCGGAA CCGATTTTTC GAATGATCGG CGAACAACCA AACAGGCGAA AAACGTAGGC GTTCCCAATG GCAGCTATGC CGAAGACATC GTGTATGTCA GCGAAACCAA CAGCGACTTC CTGCTGACCG CCGACAAACG GGTCAACGAA TTTCATATCG TAGCCTCGGT TGGCGGCAAC ACCCGCCGGA ACTACACCCA GCGCGATTAC ATGTACGCGT CGGAGTTGAC GATTCCCAAC CTATACAACA TTGGTAATGC CAAATCGCGC CCAACGGTGT ATAACCGTAT TACGGACAAG CGGGTAAACA GCCTCTACGG CTCGGCATCG CTGTCGTTCC GGGACTATCT GTTTGTGGAC CTCACGGCTC GCAATGACTG GTCGAGTACG CTGCCCGCCG GTAATCGCAG CTATTTTTAC CCGTCTGTAT CAGCCAGCGC CATTATAACA GACATGCTGG GGCTAACTTC CAACGTACTG ACCTACGCCA AGCTGCGGGG TGGCTTTGCT CAGGTGGGTA ATGACACCGA TCCGTACAAC CTGACGCAGG TGTATTCGAG CGAAACGGCT TGGGGCAACA CGACGACTTT TTCGGAGAAT AACCTGATTT ATAACAAGAA CCTGAAGCCG GAGCTAACCA CGGCCATTGA GTTTGGCGTT GAAACCCGAC TATTCCGAAA CGCGCTCAAC TTCGAGTTCA CGTATTACGA CAAGAACACG AAAAACCAGA TTTTACAGGC CAACGTGGCG CAAAGTTCGG GCTATTATAA CTCGGTTATC AACGCCGGTC AGATTCGGAA CAGCGGTTTC GAGATCGAAC TGTCTGGAGC ACCAATCAAA AATGCGGGTG GATTCCGATG GGATGTGGGT ATCAACTTCG CCCGGAACCG CTCCGAAGTG GTGGATTTGG GCGGACTGTC TACTTATCAG ATCAATACCG GTTCGCTGCT GCGCAACGTG ATTCTGGAAG CTCGGCCGGG CGATCCGTAC GGCAATTTCT ACGGTACGTA TTACCGGCGC GATCCGAGTG GTAACCTCAT TTTCAACAGC CAGGGGTACC CCATCATGGC ATCGGACCGA AAAGTGGTCG GGAACATCAT GCCGAAATGG ACGGGCGGTT TCCAGAATAC ATTCAGCTAT AAGTGGGTAT CGCTCAGTTC GCTGATCGAT GTGCGCTACG GTGGTAACGT CTTCTCGCAG GGTATCAACA TTGGTCGGTA TACGGGTGTG CTGGCCGAAA CGCTGCCCGG TCGCGAAGGC AATATTGTGG GGCAGGGCGT TGTGGAGAAG GCCAATGCCG ACGGTAGCTT CTCGTATTCG CCTAACACCA CGGCGGTAGC ATCGGCAGAT GATTACTACC ACAATTTTTA CAACCGCAAC GTCAACGAGA ATTACATTTT CGATGCGAGC TATGTAAAAC TACGGGAAGT GCGGCTGGGC TTCGCTATTC CGCAGCGGTG GCTGGGCAAA ACGCCCTTCC GCAGTGCAAC GTTTGCGCTG GTGGGCCGGA ATCTGGCGCT TCTCTACAAA AATATACCGC ATATCGATCC CGAAACCAGC TACTACGGCG ATGGTAACGT GCAGGGCTTC GAAAACGGTA ATACGCCATC GGCCCGCAGC ATGGGCTTTA ACCTCAACTT CGGACTTTAA
|
Protein sequence | MKHILTNERR RYLTLLMAIW FLTNLSTLAQ NSGRTITGKI LSKTDGAGLP GANVLVKGSS VGAVTDAAGS FSINAQPNAT LAVSYIGFVS QEIAIGNQTE VVISLAEDAS QLSEVVVTAL GISRDKKALG YSLQELKGNE LTQARPTNLV NALSGKIAGI QVTATNGLPG ASSRILIRGA NSIGGNNQPL FVVDGIPIDN GSYNVTPGST GGNVNNVTTD YGNGASSINP DDIDNISVLK GANAAALYGS RAANGVILIT TKRGSASKNI GVTVNTNTTF ENPLRLPDFQ NEYGQGLKGQ FSYVDGMGGG VNDGVDESWG PKLDGRLIPQ FNSPIGADGK RTATPWIARP DNVKNFYDTG VTTTNSIALT GGNEKGDFRL GYTNLYQKGM LPNTNYKRQN LSFNAGWNFT PKFTVRTSIN YIKDGSDNRQ NLNLYWIWFG RQVDLEDLKG NPVQPDTDPS QWPVQRNWNL NYWNNPAYAL KYLKYANDKD RLIGNITATY KLTDWLTLTG RTGTDFSNDR RTTKQAKNVG VPNGSYAEDI VYVSETNSDF LLTADKRVNE FHIVASVGGN TRRNYTQRDY MYASELTIPN LYNIGNAKSR PTVYNRITDK RVNSLYGSAS LSFRDYLFVD LTARNDWSST LPAGNRSYFY PSVSASAIIT DMLGLTSNVL TYAKLRGGFA QVGNDTDPYN LTQVYSSETA WGNTTTFSEN NLIYNKNLKP ELTTAIEFGV ETRLFRNALN FEFTYYDKNT KNQILQANVA QSSGYYNSVI NAGQIRNSGF EIELSGAPIK NAGGFRWDVG INFARNRSEV VDLGGLSTYQ INTGSLLRNV ILEARPGDPY GNFYGTYYRR DPSGNLIFNS QGYPIMASDR KVVGNIMPKW TGGFQNTFSY KWVSLSSLID VRYGGNVFSQ GINIGRYTGV LAETLPGREG NIVGQGVVEK ANADGSFSYS PNTTAVASAD DYYHNFYNRN VNENYIFDAS YVKLREVRLG FAIPQRWLGK TPFRSATFAL VGRNLALLYK NIPHIDPETS YYGDGNVQGF ENGNTPSARS MGFNLNFGL
|
| |