Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4333 |
Symbol | |
ID | 8728093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5253154 |
End bp | 5255430 |
Gene Length | 2277 bp |
Protein Length | 758 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | FG-GAP repeat protein |
Protein accession | YP_003389114 |
Protein GI | 284039184 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00966426 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.30257 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGC TGTTTACGCT TTGTTTTTTA GCAATTCCAT TTCTCACCCA TGCGCAGTCG GGCGGGAAGA TCGCGTTTCG ATACGACCAG AGCCCAACCG TAAGCATTAA TAACCGCCCG CTGCTCAATC CGTGGGTGGG TGGCCTGAAC ACGACGCAGT ATTCAACCAT TCGCCTGAAC AACGACACCC GCGACGATCT GGCCGTTTAC GACCGGACAA CCGGTAAGGT CAGCACGTTT ATCGCCGTCG ACAATCCCAT TGGCAGCGGC ACGGTATGGC AGTACGCGCC CGAGTACGAA CTAGCTTTCC CGGCGGTCAT GTACAGCTGG ATGCTGCTGG TCGATTATGA CTTCGACGGC CGGAAAGACA TTTTCACCAA CAGTTCGAAG GGCATCAGCG TATGGCATAA CGAATCGCAG AACGGGACGG TATCGTTCAA ACTGGCAGTC GATCCGCTCC GAACGCTTGG CTTCAGTGGC TTTCAGATCC CCCTTTACGT CAATGCGTCC GATCTGCCCG CTATTATGGA TTACGACGAC GATGGCGACA TCGACATTAT CACCTTCGAT GCCGATGGCA ACATCATTGC CTACCAGCAA AACATGAGTA TGGAGCGAAC CGGCACGAGA GGACTCGACT TTGCCCGCGC CGGTAATCAG TGCTGGGGGC ATTTTCAAAA AGAGTTTTGC AACGACTTCA GATTTGGGAT CAATTGCGAC GATGGCTCCG GCACCGGTGG GCGTCTGGCC GCTCCAACTG CTACCGGTCG AGTCAGCCCG ACTAGTCCCA GCGGGGCAAG ACCGCTCCAT TCGGGTAACA CCCTCACGGT GATCGATACT GATGGCGATG GTAAGAAAGA CTTGCTGTTC GGATTTGTAA GCTGTGAAAA CATTGCCCGC CTGCGAAACA CCGGGCCCAA TAGCGTAAGT GCTAACTTCA CGAGTTACGA TAGCCTGTTT CCCGCCCGAA ACCCCATTCT GTTTCCGGCC TTTCCGGCCA CTTTTTTTGA GGATGTCGAC GGCGATGGGC AGAAAGATTT GCTCGCATCG CCCAATGTAA ACTTCAACGA CGGCAATGTC TACGATTTTC GGGCGTCGGG CTGGTTTTAC AAGAATACGG GTACTACGCA AAAGCCCGAT TTTCAGCTCA TTCAGAAAGA TTTCCTGCAA AGTGACATGC TCGACCTGGG CGAACGCACG GCCCCCGCGC TGGCCGATCT GGATGGCGAC GGCGATATGG ATTTACTGGT CGGGTACGGC GGGGTAGGGG TTGGCTCCGG CTACCGGGGA GGCATCTGGC AATTCGAGAA TAAGGGGACA ACGCAAAATC CGGCTTTCGT GCTCGTCACA ACCGATTACC TGGGTATACA GTCGCTGGGG CTGACAAACG TGGTGCCTTC TTTTGCCGAT GTGGATGCCA ATGGTAGTAT GGACCTGATC GTCACCGCCA CGGGGAAACA GGCCGTAGAA ATTCGCGCGC TGATCAATAC CGCTCCCAAA GGAGCTGCCG TCCAATACAG CCTTGCCAGT GCCACCCGCT GGCCCACGCC CGATCTGATG TACCCGCTCG ATCTGCTGAC GGTTACGGAT GTCGACAAAG ACAACAAACC GGACTTACTG ATTAGCCGGT ACAACGTGGG TACCATTCTC TATTACCGCA ATGCTGGCAC GGCCACGGCT CCCGTTTTCC AGTTGCAGAA CCAGACGTTC GGTGGGATCA CGACGGACGA TTACATCTAC GCCCGCGCCC GGTCGCTGGT GGTGGCCGAC CTGAACGGCG ATCAGAAAAA CGAACTCATT GCCGCAGCCG ATAACGGTTC GGTTAAGGTC TATCAATTTC CCGAAACCCC GACTCAGGCG TTTACGCTGA TCGACTCGCT GGCGGGCATA GGCTTGCCCG GCAAAGGACT TATTGCCGCA GCCGCCGACC TCGACGGCGA CCAACTGCCC GATCTGTTGC TTGGTGGTAC GGGTGGCGGA TTGCGCTATC TGAAAAACAC CTCCCAAAAG ATCGTTGTGA CCGGCCTACC CGAAGAACCG ACCGGCCCAT GGGTGTTCCC CAACCCCACT AACCGGTACA TTACCGTTCG TCCGCACTAC GATGGGCGCG TTGAACTGGT GTCTTTAACG GGCCAGACCG TGGTGCCCGT TCAGCCGGTT AAAGCCGGGA CAGAAAGTCT CCTTGATTTA GGTGAGCTGG CCGATGGAAC GTATCTGATC CGACTCCAGA GCGATAACCG CCCGGTACAA ATTCAGAAAG TGGTGGTCTG GAAATAA
|
Protein sequence | MKKLFTLCFL AIPFLTHAQS GGKIAFRYDQ SPTVSINNRP LLNPWVGGLN TTQYSTIRLN NDTRDDLAVY DRTTGKVSTF IAVDNPIGSG TVWQYAPEYE LAFPAVMYSW MLLVDYDFDG RKDIFTNSSK GISVWHNESQ NGTVSFKLAV DPLRTLGFSG FQIPLYVNAS DLPAIMDYDD DGDIDIITFD ADGNIIAYQQ NMSMERTGTR GLDFARAGNQ CWGHFQKEFC NDFRFGINCD DGSGTGGRLA APTATGRVSP TSPSGARPLH SGNTLTVIDT DGDGKKDLLF GFVSCENIAR LRNTGPNSVS ANFTSYDSLF PARNPILFPA FPATFFEDVD GDGQKDLLAS PNVNFNDGNV YDFRASGWFY KNTGTTQKPD FQLIQKDFLQ SDMLDLGERT APALADLDGD GDMDLLVGYG GVGVGSGYRG GIWQFENKGT TQNPAFVLVT TDYLGIQSLG LTNVVPSFAD VDANGSMDLI VTATGKQAVE IRALINTAPK GAAVQYSLAS ATRWPTPDLM YPLDLLTVTD VDKDNKPDLL ISRYNVGTIL YYRNAGTATA PVFQLQNQTF GGITTDDYIY ARARSLVVAD LNGDQKNELI AAADNGSVKV YQFPETPTQA FTLIDSLAGI GLPGKGLIAA AADLDGDQLP DLLLGGTGGG LRYLKNTSQK IVVTGLPEEP TGPWVFPNPT NRYITVRPHY DGRVELVSLT GQTVVPVQPV KAGTESLLDL GELADGTYLI RLQSDNRPVQ IQKVVVWK
|
| |