Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4978 |
Symbol | |
ID | 8728742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 6064862 |
End bp | 6066592 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | RagB/SusD domain protein |
Protein accession | YP_003389755 |
Protein GI | 284039825 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000000462292 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAAAC TACTAAGTAT ACTGGCTATT GCCCTTGGTT TGTCGGCCTG CGACCTGAAC ATGCTGCCTC AGGACGCTAT TTCGCCCAAT ACGTTTTTCA ATACCGAAAA CGACCTGCTG CTGTATACAA ACTCTTTCTA TAACGCCCTG CCATCGGCCG AGGATGTGTA TAATGAAGAC GTCGACAATG TGGTCAAGAA CAGCCTGCGC GATGAGTTGC AGGGTACGCG GGTAGTGCCT ACCAGCGGGG GCGGCTGGAG CTGGGGCACC CTGCGTAATA TCAATTATTT TCTGGCCAAT TCGGGGAAGT GCCCGGATGC CAAAGCCGTG GCCAAATACA ATGGACTGGC CCGCTTTTTC CGTGCCTACT TTTACTTCGG CATGGTGAAA CGCTTTGGTG ACGTTCCCTG GTATTCGAAA CCCATTGAGG TGATGGACCA GGAAATGCTA ACCAAACCGC GCGACCCCCG CACGATGGTG ATGGATTCGG TGATGGCCGA TATTAATTAT GCCATTGCCA ATCTCGATGC ATCGCGGCAG GTAACGACCG TTACCAAATG GACGGCGCTG GCCCTGAAAT CCAGAATCGG TCTGTACGAA GGCACCTTCC GGAAATACCA TACCGAATTC GGTCTGCCCA ACGCCGATAA ATTTCTGGAC GAGAGCATTG CGGCCTCGGC TGATTTGATG AAGAACAGCG GGTACACGAT TTACAAAGCC ACGCCCGCCA CGGCCTACGG TAAGTTATTC TCGTCCGATA ACGCCATTCC CGATGAGGTG ATTCTTGCCC GTGATTTCAG CGATGAGTTA CAGGTGTACC ATAACCTGAA TTACTACACT ATGACGGCTT CGTACGGAAA GCCGGGCCTG GAGAAGAAGC TGGTGAATAG CTACCTGATG GCCGACGGAA CCCGCTTCAC CGACATCAAA GGCTATGAAA CTATGCAGTT TGCGGAGGAG GTCCAGAACC GTGACCCTCG TTTGTCGCAG ACTATCCGCA CACCCGGCTA CACCCGTATT GGCGAAACGA CGCCACTAGT ACCGGAGTTT GGTGCTACGG TTACCGGCTA TCAGCTGATC AAGTTCGTTT CGGCACCCGA GTGGGATACG TTTACCAAAG ACATTACGGA TATGCCCATT TTCCGGTATG CCGAGGTGCT GCTGAACTAT GCCGAAGCCA AAGCCGAGCG GGGTACGCTG ACCCAGGCCG ACCTCGATCT GTCGACCAAA CTCACCCGCG ATCGGGTGGG GATGCCGAAC ATCAATCTGG CCGATGCCAA CGCCAACCCC GACCCGTATC AGGCGCAGCA ATACACGCAG TTAAAGGGAG CCAACATGGG GGTAATTCTG GAAATACGGC GCGAACGGCG CGTGGAACTG GTGATGGAAA ATTTCTTCCG CTGGGACGAT ATCATCCGCT GGAAAGAGGG GCAACTGCTC ACCAAGACAT TTAAGGGTAT GTATTTCCCG GGTCCGGGCA GCTATGATCT GGACAAAAAC GGCAAAGTCG ATCTGGTTAT CTACGAGGGC ACTAAACCAT CGGTGCCGGG CGCTCAACTC CTCAAGCTGG GCAGCGAAAT CCTGCTGGAG AATGGTAACA AAGGCGGTAA CATTGTTGTA AACGGGCATA TCAACAAGAA ATTCAACGAG TCCCGCGATT ACCTATACCC CATTCCAACG CAGGAGCGGC TGTTGAATAC CAAATTGACC CAGAATCCGA ACTGGGAGTA G
|
Protein sequence | MKKLLSILAI ALGLSACDLN MLPQDAISPN TFFNTENDLL LYTNSFYNAL PSAEDVYNED VDNVVKNSLR DELQGTRVVP TSGGGWSWGT LRNINYFLAN SGKCPDAKAV AKYNGLARFF RAYFYFGMVK RFGDVPWYSK PIEVMDQEML TKPRDPRTMV MDSVMADINY AIANLDASRQ VTTVTKWTAL ALKSRIGLYE GTFRKYHTEF GLPNADKFLD ESIAASADLM KNSGYTIYKA TPATAYGKLF SSDNAIPDEV ILARDFSDEL QVYHNLNYYT MTASYGKPGL EKKLVNSYLM ADGTRFTDIK GYETMQFAEE VQNRDPRLSQ TIRTPGYTRI GETTPLVPEF GATVTGYQLI KFVSAPEWDT FTKDITDMPI FRYAEVLLNY AEAKAERGTL TQADLDLSTK LTRDRVGMPN INLADANANP DPYQAQQYTQ LKGANMGVIL EIRRERRVEL VMENFFRWDD IIRWKEGQLL TKTFKGMYFP GPGSYDLDKN GKVDLVIYEG TKPSVPGAQL LKLGSEILLE NGNKGGNIVV NGHINKKFNE SRDYLYPIPT QERLLNTKLT QNPNWE
|
| |