Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1167 |
Symbol | |
ID | 8724900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 1423621 |
End bp | 1426836 |
Gene Length | 3216 bp |
Protein Length | 1071 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003386017 |
Protein GI | 284036087 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAAT TTCTATTAAC ACAGTTTGTC CTGTGTTTAT TCGCACTTCC ATTGATAGCT CAGGATATAG CCATCAGTGG AAAGGTCACA TCGTCGGAGG ATGGTTCAGT GCTTCCCGGT GTTAACATTA CGGTCAAAGG AACGTCTCGC GGAACCAGCA CCAATGCTGA GGGAACGTTT CAGCTTAACG CTCCAGCCAA CTCAAGGCTG GTATTTAGTT TTATTGGCTT CACAACACAG GAAATTGCCA TTGGCAACCG AACAAACATC AGTGTTAATC TTGCCCCCGA CGCGTCTCAG CTTCAGGAGG TTGTCGTTAC GGCGCTGGGT ATCTCCCGCG ACAAGAAGGC ACTGAACTAT GCCGTTCAGG ACCTGAGAGC CGATAAACTA AACTTTGCAC GTGACCAGAA TGTGGGTAAC GCGCTGGCGG GTAAAATTGC CGGTGTGCAG GTACTCGGTC AGTCGGGGGC TAAGTTCGGT AACCCGAACA TCCGCATCCG GGGCGTTAAC TCGCTGTCGG GTGGCGACCC GCTCTACGTT GTAGACGGTA CGCCAACGGA CATTAGCCAG GTGAACATGG ACGATGTAGA GAACCTGACC GTGCTGAAAG GACCCTCTGC AACGGCTCTG TATGGTAACC GTGCTTCGGC GGGCGTTATC GTCATCACAA CTAAGCGTGC CAAAGCCGGC GAAACCCGTC TGGACATCAA CCACAGTACA ACGCTCGACA TGGTGGCTCT GCTGCCTAAG TACCAGAACG AATACGGTGG TGGTTACTCG CAGGAGTGGG AAACGTTCCA GTTCGATCCA TCCATTCACC CGGCGGCATG GTCGTCGTTC AATGGGCAGA AAATTCTGGA CTACTCGGCC GATGAAAGCT GGGGTCCCCG TATGGATGGC TCACCCCACC GGTCGGCGTT TTCCTGGCAG CCAGGTGCTG AATTTGGTCA GCTGACGCCG TTCTCGCCAC AGCCCAACAA CGTACGCGAC TTCTTCGAGA AGCCGATCAG CAACAACACG AATATTGCCT TTTCGCGCGG AACGGAGGCT TTTCAGAGCC GTATCTCCTA CACGCACATC ATCAACAACG GGATTATCCC CAACTCGTCG CAGTCTCGTG ATTACGTCAG TGCGAAGAAT GCCATCAAGT TTGCTGAGAA ATTAACAGCG AACCTGAATT TCAACTACAC ATCGACAAAC ACGAAAAACC AGCCTGCCGA CCGCTATGGC TCATCGGGAG GAACAACGCC ACAGAACAGT CCGCTGGGTA TCTCCAACTC GACCCTGAAC GGCTACAACC AAACGATTGG TATGTTCAAT CAGTGGTTTC AGCGTCAGTT ACGGATTGAG GATTTGCGTA ATTACAAAAA TCCGGATGGT ACCTTCCGTA GCTGGAACAT CGGCGGTCCC TTGGAAGCGG CCCCTAAATA TTGGGATAGC CCATACACGC AGGCCTACGA AAATACCAAC ACCAACCGGA GCGACAGGTT ATTTGGTGAC ATTGGCCTGA CTTACCAGTT CACACCGGCT CTGAAAGCAT CGGCTACGGT ACGCCGTGAC CAGAACGCTT ACTATCAGCA GGGCCGAGTG GCAATTGGTA CACTGAATGA AGGACAGAAA GGTGGCTTCT CGACCTTAAC CTCCAACAGC CGCGAAAACA ACTATGAGTT GCTGGTGAAC TACAACGAGA ACTTCAAAAA CTTGTCTGTT GTCGCCAACG CAGGGGGGAA CATCCGCTAC AACCGTGTTG ATGGTCTTTT CCAGGCCACA GTAGGGGGCT TATCGGCACC AGGTTTCTAT AACATCGCGG CTTCTATTGA CCGGCCATTA TCCAACAACT ACCTCTATGA GCGCCGGATC AACAGTGTGT TCGGAAACGT AAGTGTTGGT TTCCGTGACT TTGTTTTCGT TGAAGCTTCG ATCCGGAATG ACTGGTCGTC TACGCTGCCT AAAGCGAACA ACGCCTATCT GTATCCGTCG GTGTCGGCCG GTGTTATCCT GACCGAATTG CTGCCCAAGA GCCAGGTACT TTCCTATGCT AAAGTTCGGG CGGGCTATGC TCAGGTGGGT ACCGATGTTG GCCCTTACCA AACAGCCCTG GCGTATACCT CTGGTCAGCC TTATGGCAGC AATGCAACCG CCTTTTTGCC CGGCACATTG CCCAACGCCA GTCTGAAGCC CGGCCTATCG TCTTCCTATG AAGGCGGTAT CGACCTGAAA TTCCTGAACA ACCGGATCGG TGTGGAATTT ACTGCCTATC AGAATGACAA CAAGAACCAG ATCATTCCGC TGCCGGTAGC GCCTACGAGC GGGTATACCA ATGCCGTAGT AAACGCCGGG TTGATCCGTA CGTCGGGTCT TGAATTGCAC ATTTACGCTA ACCCAATCCG GTCGGCGTCT GGCTTTAACT GGGAGTTCGA CATCAATGCA GACCGCAACC GTTCGCAGGT GATTGAACTG GCCAATGGCC TGACCAACTA CCAGATCGAC GGCCCACAGT GGCGTACACT GACGCTGAAC GCCCGTACTG GTACCGATGG CTCACCCCGC GACTGGGGTA CGCTTGTGGG ACAGGGGATT CAGAAAGACG CGAATGGTCG GAACATGGTA TATGGCAGCG GTGCAAACGC CGGTCTGTAT ATCAAACAGG ATAACGTTGA GCTAGGCTCG GTACTGCCCA AGTTCAAAGG CGGCTGGTTG AATACCTTCA GCTACAAGAA CGTGACCCTG CGCGTAAACA CTGACTTCGT TGTAGGTGGT AAATTCTTCT CGACCACCAA AATGTTTAAT GCTTACTCGG GTTTGGCAGC TGAAACAGCG GGTCTGAACG AATTGGGTAA GCCACTGCGT GATGATCCGG CTTCGGGCGG AGGTGTTTTG CTGGATGGTG TAACCGAAGA CGGAAAGCAA AACACGACTC GTGTTGATGC ACAGAACCTG TACGAAAACT GGCTGTTCGC CCTGAACGAG CGCTGGATTT ATGACAAAAC GTACGTGAAA CTGCGCGAAG TTTCGTTCGG TTACAACCTG CCGAAGCAAA TGCTGGGCAA GTGGTTGAAG TCGGCAAATA TTTCGTTGAT TGCCCGTAAC CCGGTTCTGA TCTACAGTGC CATTGGCGGT GGTATCGACA TCTCTGAGTC GGAGACGATC TGGTACGAAG GTGGTCAGTT ACCTCCGGTT CGCTCGTTCG GTGTAAATCT TAGATTAGGC CTCTAA
|
Protein sequence | MRKFLLTQFV LCLFALPLIA QDIAISGKVT SSEDGSVLPG VNITVKGTSR GTSTNAEGTF QLNAPANSRL VFSFIGFTTQ EIAIGNRTNI SVNLAPDASQ LQEVVVTALG ISRDKKALNY AVQDLRADKL NFARDQNVGN ALAGKIAGVQ VLGQSGAKFG NPNIRIRGVN SLSGGDPLYV VDGTPTDISQ VNMDDVENLT VLKGPSATAL YGNRASAGVI VITTKRAKAG ETRLDINHST TLDMVALLPK YQNEYGGGYS QEWETFQFDP SIHPAAWSSF NGQKILDYSA DESWGPRMDG SPHRSAFSWQ PGAEFGQLTP FSPQPNNVRD FFEKPISNNT NIAFSRGTEA FQSRISYTHI INNGIIPNSS QSRDYVSAKN AIKFAEKLTA NLNFNYTSTN TKNQPADRYG SSGGTTPQNS PLGISNSTLN GYNQTIGMFN QWFQRQLRIE DLRNYKNPDG TFRSWNIGGP LEAAPKYWDS PYTQAYENTN TNRSDRLFGD IGLTYQFTPA LKASATVRRD QNAYYQQGRV AIGTLNEGQK GGFSTLTSNS RENNYELLVN YNENFKNLSV VANAGGNIRY NRVDGLFQAT VGGLSAPGFY NIAASIDRPL SNNYLYERRI NSVFGNVSVG FRDFVFVEAS IRNDWSSTLP KANNAYLYPS VSAGVILTEL LPKSQVLSYA KVRAGYAQVG TDVGPYQTAL AYTSGQPYGS NATAFLPGTL PNASLKPGLS SSYEGGIDLK FLNNRIGVEF TAYQNDNKNQ IIPLPVAPTS GYTNAVVNAG LIRTSGLELH IYANPIRSAS GFNWEFDINA DRNRSQVIEL ANGLTNYQID GPQWRTLTLN ARTGTDGSPR DWGTLVGQGI QKDANGRNMV YGSGANAGLY IKQDNVELGS VLPKFKGGWL NTFSYKNVTL RVNTDFVVGG KFFSTTKMFN AYSGLAAETA GLNELGKPLR DDPASGGGVL LDGVTEDGKQ NTTRVDAQNL YENWLFALNE RWIYDKTYVK LREVSFGYNL PKQMLGKWLK SANISLIARN PVLIYSAIGG GIDISESETI WYEGGQLPPV RSFGVNLRLG L
|
| |