Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4760 |
Symbol | |
ID | 8728524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 5795219 |
End bp | 5798521 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | TonB-dependent receptor plug |
Protein accession | YP_003389537 |
Protein GI | 284039607 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAAA GCTTTTACTA CAAAACAATG CCATTGTATA CCCAAAAGAC GCTACTCTTA TCGGTAGCTC TGCTGGCTAT GCTCTGTAGC CTGTCCTATG CCCATGGTTT ACGCAGGCCG GTTACTGTGA TCGACCAAAC CATCACCGGT ACGGTTAGCG ACGATAAAGG TGAAGTACTT CCCGGCGTCA GTGTGGTTGT AAAAGGCACA CAACGGGGTA CCACGACCGA TGTCCAGGGA CAGTATAAAC TCAACGTTCC GGACGGAAAA GCCACGCTGA TCTTCTCATT TGTCGGCTAC CTGCCGCAGG AAGTTCAGGT GGGAAACCAA AGTATCATCA GCGTTACCCT TAAAACCGAC TCCAAGTCGC TGGAAGAGGT GGTCGTGGTA GGCTATGGCA CGCAGAAGAA GGTTAACCTG ACCGGGGCCG TAGATCAGGT TACGAGCGAA GTACTCGAAA ACCGCTCCCT TCCCAACCTC AGTCAGGGTT TACAAGGCAC TATTCCAAAC CTGAACCTGG TTATGGGCGA TGGCAAACCG ACACAATCGC CGACCTACAA TATTCGCGGA ACAACCTCCA TTGGTCAGGG TGGTAATGCG CTGGTGCTGA TCGACGGCGT GGAAGGCGAC CCCAGCCGAC TGAATCCCAA CGATGTAGCC ACGGTATCGG TGCTGAAAGA TGCCGCTTCG GCCGCTATCT ATGGCGCGCG GGGTGCCTTT GGCGTCGTGC TGATTACCAC CAAAAGCCCG ACCAAAGACC GAACGAGCAT TACGTATTCG GTCAATCATT CCATCAAAAG CCCGACCACC GTTCCGAAGT ACGTAACGAA TGGCTATCAA TTCGCGAAGA TGTTCAACGA GGGCTGGTCG GCCTGGAACG ATTATTCGCA GACGCCCCAG AACGTCAACA AAACGGTGCG CTTTTCGCCC GCTTACCTGA CCGAGCTGGA GCGCCGTAAC AATGACCCGA CCCTGCCAAA AACCGTAGTC GACCCCACCA CGGGCGAGTA TGTGTATTAC GAAAACATGG ATTGGTACGG GGAGCTTTAC AAGAAAAACA CCAGCGCCAC CGAGCATAAC CTGTCATTTT CGGGCAGCAG CGGCAAAGCC GACTTCTACG TGACGGGCCG CTACTACACC CAAGACGGCA TTTTCAAGTA CAATTCCGAC GATTACAAGA TTCTGAGTTT GCGCGCCAAA GGCTCTATCC AATTGTATCC GTGGCTGAAG ATTGGCAACA ACGCCGATTT CTCGTCCATG AAGTACCATA ACCCGCTCAA CGTGGGCGAA GGCGGCAGCA TCTGGCGTAA CATCTCGGAC GAAGGCCACA CGGTTGCCCC GATGTTCAAC CCCGATGGTA CCCTCACTTA CTCGGCTGCT TATACCGTTG GTGATTTCTG GTATGGCAAA AACGGCATCG ACATGGACCG GCGCGTATTT CGGAATACAG CCGATTTCTC GACGAAGTTC TTCGATGACA AGCTGCGTGT GAACGGTAAC TTTACTTTTC AGACAACCGA CAACAACGAG TTCCGGACCC GCGTACCAGT TCCCTATAGT CGTAAACCGG GGGTTATCGA GTATGTCGGC ACGAACTTTA ACGATCTGCA AAACCTCTAC CGCGAAACGC AGTACATGGC GACCAACCTC TACGCTGAGT ACGAGCCGCG CTTCAGCCCG AATCATTACG TAAAAGCGCT GGTGGGCTAC AACTACGAGC AGTCGAACTT CAAACGGCTC GAATTGGTCC GAAACGGCCT TATCTATCCC GACGCCAAAG ACATCAACCT CGCACTGGGT CAGTCAATTA CAACCAGTGG CGGCTCGGAG AAGTGGGCTA TCCTGGGCGG TTTCTACCGA TTGAACTACG CTTTTAAAGA CCGGTATCTG GTCGAACTGA ATGGCCGCTA TGATGGCTCG TCCAAATTTC CGACCAACCA GCGATACGCC TTTTTCCCAT CTGTTTCGGG TGGCTGGCGC GTGTCGAACG AATCGTTCTG GAAAGTATCG CCCAAAGCCA TTACGGATCT GAAAATCCGG GCTTCGTACG GTTCGCTGGG CAACGGTAGC ATTGGCTCAT ACGCGTTTCA GGAGCAGTTC AACATTTCGC AGTCGGCACG GGTGCTCAAT GGCGTGAAGC CCCAAAAAAC GGGTCAGCCC ACCGTTATTC CTGACGGTCT GACGTGGGAA ACTTCCACCA CCTCCGATCT GGGTATCGAC TTGGGGATGC TTAACAACCG CCTGACTTTT ACGGGCGATG CGTACATCCG CAAAACAACG GGCATGTTCA CGACGGGCAT GACCTTACCG GCTGTTTTCG GTACCGATGT ACCCAAAGGC AACTATGCCG ACCTGACCAC CAAAGGCTGG GAAGCCGTGC TGACCTGGCG GGATAAACTG AAAGTTGCCA GCAAGCCCTT CAATTACGAA GTTCGGCTGA CGATGTCCGA CTACCAGGCC ACCATCGACA AGTTCAACAA CCCGAATCAG CGTCTGACAG ATTATTACGC GGGCCAGAAA GTGGGTGAAA TCTGGGGATT CGAAACGGCC GGATTCTTTA CCTCGGCTGA TGACATCGCT AAATCACCCA AACAAACCCT GTATAAAGCC TCCAACACGG GCCAGTTGCT GCCGGGCGAT ATTAAGTTCC GCGACATCAA CGGCGATGGA GTAATCAACA ACGGCGACAA TACTGTAGGC AACCCCGGCG ACCGGCGCAT TATCGGCAAC TCGACACCCC GGTACACCTA TGGCGTGATG CTCAATGCCG ACTGGAACAA CTTCTTCTTT TCGACTTTCT TCCAGGGCGT TGGCCAGCAG GATTGGTGGC CGGGTTCGGA AGCCGGGATT TTCTGGGGAC AGTATAACCG GCCTTACAAC AAGCTGCCTG AATGGCAACT AGGCAACATC TGGTCGGAAC AAAACCCGGA TGCCTATTTA CCACGCTACC GGGGTTACGT AGCCCAGAAC GGCTCAGGTG AACTGGCTCA GGCCCAGACC AGATACTTGC AAAATGCAGC TTATGTACGC ATGAAAAATA TCCAGTTTGG CTACAACCTG CCCCGAACGC TGATTCAGAA AGTGGGCATG AGCAGTGCGC GGGTGTTTGT ATCGGGCGAA AATCTCTTCT CCTGGTCACC GTTATACAAA ATCACCCGGG ATTTAGACAT TGAAAATATT GGCCGTTCGG ATGCGGTTTT AAACCCGCCG ACCAACAGCG ACCCCAACAG TAATAACAGT GGCAACGGCA ACAACTACCC GATCCTGAAA AGCTTCACGA TGGGTTTATC GGCCACGTTC TAA
|
Protein sequence | MMKSFYYKTM PLYTQKTLLL SVALLAMLCS LSYAHGLRRP VTVIDQTITG TVSDDKGEVL PGVSVVVKGT QRGTTTDVQG QYKLNVPDGK ATLIFSFVGY LPQEVQVGNQ SIISVTLKTD SKSLEEVVVV GYGTQKKVNL TGAVDQVTSE VLENRSLPNL SQGLQGTIPN LNLVMGDGKP TQSPTYNIRG TTSIGQGGNA LVLIDGVEGD PSRLNPNDVA TVSVLKDAAS AAIYGARGAF GVVLITTKSP TKDRTSITYS VNHSIKSPTT VPKYVTNGYQ FAKMFNEGWS AWNDYSQTPQ NVNKTVRFSP AYLTELERRN NDPTLPKTVV DPTTGEYVYY ENMDWYGELY KKNTSATEHN LSFSGSSGKA DFYVTGRYYT QDGIFKYNSD DYKILSLRAK GSIQLYPWLK IGNNADFSSM KYHNPLNVGE GGSIWRNISD EGHTVAPMFN PDGTLTYSAA YTVGDFWYGK NGIDMDRRVF RNTADFSTKF FDDKLRVNGN FTFQTTDNNE FRTRVPVPYS RKPGVIEYVG TNFNDLQNLY RETQYMATNL YAEYEPRFSP NHYVKALVGY NYEQSNFKRL ELVRNGLIYP DAKDINLALG QSITTSGGSE KWAILGGFYR LNYAFKDRYL VELNGRYDGS SKFPTNQRYA FFPSVSGGWR VSNESFWKVS PKAITDLKIR ASYGSLGNGS IGSYAFQEQF NISQSARVLN GVKPQKTGQP TVIPDGLTWE TSTTSDLGID LGMLNNRLTF TGDAYIRKTT GMFTTGMTLP AVFGTDVPKG NYADLTTKGW EAVLTWRDKL KVASKPFNYE VRLTMSDYQA TIDKFNNPNQ RLTDYYAGQK VGEIWGFETA GFFTSADDIA KSPKQTLYKA SNTGQLLPGD IKFRDINGDG VINNGDNTVG NPGDRRIIGN STPRYTYGVM LNADWNNFFF STFFQGVGQQ DWWPGSEAGI FWGQYNRPYN KLPEWQLGNI WSEQNPDAYL PRYRGYVAQN GSGELAQAQT RYLQNAAYVR MKNIQFGYNL PRTLIQKVGM SSARVFVSGE NLFSWSPLYK ITRDLDIENI GRSDAVLNPP TNSDPNSNNS GNGNNYPILK SFTMGLSATF
|
| |