Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_2934 |
Symbol | |
ID | 8726685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 3544160 |
End bp | 3547123 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | |
Product | TonB-dependent receptor |
Protein accession | YP_003387746 |
Protein GI | 284037816 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAAACA TGGAAATAAA ATTACGTAAG CATGACATTC TTCCGGTATT GGTTGCCATG TTCATGATGG TCGCTCTATG CCCAATTTCC GCTCTGGCGC AGAGCAAGCG AATAACCGGT AAGGTAGTTT CCAGTGTTAA CTCCGAAATA GTACAGGGAG TCAACGTATT AGTAAAAGGC AACAGCCGAA AGGGTGCCGT AACGGATGGT GAAGGTAAAT TTTCTCTGGA AGCCACACCG AACGACGTGC TTGTTTTCAG TTTCATTGGC TTTAAATCGA AAGAAGTTAA AGTAGGTAGC GAAACCACCT TCAACATTTC GCTGGATGAA GATGCCACCC AGTTGACGGA ATTGATTGTG ACTGGTTCGC GCAATACCGG GCGTACAATT CTGGAAACCC CGGTTCCGGT CGACGTTATT TCGATCAAGG ACATAATGGG CGAGCTTCCG CAAATCGATC TGGCACAAAT GCTGGCTTTT GTCGCACCAA GCTTCAATGC CGTTCGGTCG CAGGGTGGTG ATTTGAACTC CCACGTCGAC CCTGTTCAGT TGCGCAACAT GGCTCCCAAC CAGATCCTTG TGCTGGTAAA CGGAAAGAGA CGGCACACAT CCGCACTCCT TATTACGGAA ACCGCCGTTG GCAGCCCATC TACAACGGTC GATCTGATGA CGATTCCGGT ATCGGCTATC GACCGGGTTG AAATCCTGCG GGATGGTGCC GCTGCGCAGT ATGGCTCTGA CGCCGTTGCG GGTGTGGTCA ATATCATCCT CAAGAAAGGC ACTAACAAAC TAACGGCTAA CCTGACCGGT GGTGGGTATG CCAACACGGG TGGGCAAGCC GGTGCGCTGA CAAAATCGGG TAAACCCGAC GGTTTTAATT ATCAGTTTGA TGCCAACTAC GGCTTCAAAA TTGGCGACAA AGGCTATTTT AACATGTCTG GTCAGATTAC ACAGCGTCGG CCAACACTCC GTCCGTTTGT GAATGACTGG GGCTTTTTCG ATAAAACGTA CCTCAACAAC CTGAGAACCG ACAAAGCGGG CAATCCGGTC ATTACCAACC CTGAATTAAT CAATGCACAG GCGGCAGGTA ACACCTCACA GATTGCCGCA CTAACTACTG AAACGGGGCT AATGACCGCG CGCGGTTTGA CAAAAGCCGA TTTCGCGGTG TATGCCGGTA TGCCCGCCAT TACCCTTGGC AGCACCTTCT ATAACGCAGG GTATGAGATT AACCCAACCA CAACCATCTA CAGCTTTGGT GGTGCATCGT ATAAGTATCT GGAAGGGTTC TCCTGCTATT TCCGCCGACC CGCCCAAACC GACCGATTCA ACTACCTGCT CTACCCGAAC GGTTTCCGGC CTCAGATGAC ATCCAACACT TCCGATGTAT CGAACACCAT TGGTCTCAAG AGCAAAATCG GCGAGTTCAG CGTTGACTTC AGCAATACCT TCGGCCGGAA TACGATGCGA CTTGGCATGG TCAACACCAT GAATGCATCT TTAGGCTCCA ATTCGCCGGT GAACATGAAC CTGGGTACTC ATCAGTTTTC CCAGAACTCG ACTAACCTCG ACATGTCCCG TTACTTTAAA GGCATCATGA ATGGACTGAA CATCGCGTTT GGTGCCGAAA TGCGTATCGA GAACTACAAA ATCATGAAAG GGCAGGAAGA AAGTTACGCC TACGGAACGG CAGGTGTCGT TACCGTTGGA AAAGACGGAC TCCTGGTTGG CCCGGACGGA AAACCGCTGG AGAACGCGAG CAGCGTTCCC ATTGTTGATG CCAACGGAAA CCCGCTGGCA GTAACAGCTG GCCAGCAGGT AACAGTTAAG TCGCTCTCGT CCAATTGCCA GTGCTTTGCC GGTTTCGGCC CAAAAAATGA GCGTAATGAG TTCAGAACGA CAATGGCCGC CTATCTGGAT GCTGAGCTGG AGCTGACCCG GAAATTCCTT GTCGCCGGTG CGTTCCGACT GGAGAATTAC TCTGATTTCG GCGGTGTCAC CATTGGCAAA CTGGCCGCTC GCTATTCGAT CACCAAAACG CTTTCGTTGC GCGGATCGAT TGCCTCTGGT TTCCGGGCTC CTTCGCTACA GGAATTGAAC TATACGCACA CAGCTACCGC TTTCGTCCCG GATAAAAATG GTATTCCCCA GCCGCTTGAT GTAACCACTT ACCCGACCAA CAGTACTGCT GCCCGCGTAT TAGGTATCAA AGGGTTAAAG CAGGAGCAGT CGCGTACTTA TGGGATAGGC CTTACCTACC AGCCGGCACC AGGCTTTGAA GTAACGCTGG ATGCCTACCA GATTGACGTT GATAACCGAA TTTTCCGGAC CAGCTATTTC AACGCATCGG AAGTAGGCAA CAACTACAGT GAGGTAATCG GCGAAGGCGA GGCCCAATTC TTCGTTAATG GAGCCGATGT TCGCTCGAAA GGTCTTGAAG CCGTAGGCAA CTACACGCTC AACTTGCAAA AAGGCAAAAG CCTGACGTTC ACGCTGGCAA CTATTCTCAG CAAAAACACA GTTCTCAACC GGAAAGTCCT TGACCTGAAT GTGGCCAATC TTACGTCGGA GCAGATTGTG GAAAAGTACC TGAGCCGTGA TGTGATCGGG CAGTTTGAAA CAGGCACCCC ACGAACCAAA CTGATTGGAT CAGTAACGTA TCGGGTAAAC AGATTTAACG CTATGCTGCG CGGCACCTAC TTTGGTACCG TAACGGAGCG GTCAGTTTCT TCAGACAACG ACGGCAACTT TTACGACCAG ACCTTCTCTC CCCAGGCCGT TTTTGACCTG AGCTTCGGCT ACGACCTGAA CCGGAACGTG AAAGTATCGA TTGGTGGCAG CAATATATTC GATAAATACC CGCAGATACT TCGTCCAGAG AACCAGGGTT TCTATCTTTA CTCCAACAAT CAGCAGGGGT CCAATGGTGC GTATTATTAT GGCCGTTTAA CCTTCAACTT TTAA
|
Protein sequence | MLNMEIKLRK HDILPVLVAM FMMVALCPIS ALAQSKRITG KVVSSVNSEI VQGVNVLVKG NSRKGAVTDG EGKFSLEATP NDVLVFSFIG FKSKEVKVGS ETTFNISLDE DATQLTELIV TGSRNTGRTI LETPVPVDVI SIKDIMGELP QIDLAQMLAF VAPSFNAVRS QGGDLNSHVD PVQLRNMAPN QILVLVNGKR RHTSALLITE TAVGSPSTTV DLMTIPVSAI DRVEILRDGA AAQYGSDAVA GVVNIILKKG TNKLTANLTG GGYANTGGQA GALTKSGKPD GFNYQFDANY GFKIGDKGYF NMSGQITQRR PTLRPFVNDW GFFDKTYLNN LRTDKAGNPV ITNPELINAQ AAGNTSQIAA LTTETGLMTA RGLTKADFAV YAGMPAITLG STFYNAGYEI NPTTTIYSFG GASYKYLEGF SCYFRRPAQT DRFNYLLYPN GFRPQMTSNT SDVSNTIGLK SKIGEFSVDF SNTFGRNTMR LGMVNTMNAS LGSNSPVNMN LGTHQFSQNS TNLDMSRYFK GIMNGLNIAF GAEMRIENYK IMKGQEESYA YGTAGVVTVG KDGLLVGPDG KPLENASSVP IVDANGNPLA VTAGQQVTVK SLSSNCQCFA GFGPKNERNE FRTTMAAYLD AELELTRKFL VAGAFRLENY SDFGGVTIGK LAARYSITKT LSLRGSIASG FRAPSLQELN YTHTATAFVP DKNGIPQPLD VTTYPTNSTA ARVLGIKGLK QEQSRTYGIG LTYQPAPGFE VTLDAYQIDV DNRIFRTSYF NASEVGNNYS EVIGEGEAQF FVNGADVRSK GLEAVGNYTL NLQKGKSLTF TLATILSKNT VLNRKVLDLN VANLTSEQIV EKYLSRDVIG QFETGTPRTK LIGSVTYRVN RFNAMLRGTY FGTVTERSVS SDNDGNFYDQ TFSPQAVFDL SFGYDLNRNV KVSIGGSNIF DKYPQILRPE NQGFYLYSNN QQGSNGAYYY GRLTFNF
|
| |