Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5004 |
Symbol | |
ID | 8728768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 6095780 |
End bp | 6098812 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003389780 |
Protein GI | 284039850 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGAA ATACAGTAAG CCGTTTTGTA TTGCTGGTAC TGATCGTTTG TTTCTCCGGA TTCGTGGCAG ACCCGCTGGA GGGTGCATGG GACTGTTCGA TCAATCTGGA CGGGAAACTG GTCGGAACCC TGAAGGTAAG CCGGGTGAAT GCGTCGTATA CCGGCGTGTT CTTCTCCTTT GAACGGGGAG AAATACCCGT CCACCATGTA ATGGTTGATG GAAACAAACT GTCCGGGGAT TTTACGCTGA GGAAAGAAGT ATACCAGTTG GTGGCTGAGG CAACGGGAAA TACGCTGACG GGCAGTGTTG TGTCGGGTGC CAAACGGTTT ACGTTGCTGG CTAATCGAAT CACCGAACCC GCACAGGCGT ATGTGCCTCC TCGGGTCTCT TACATTCTGG CGGATAAGGA ACTGAATGGG TTTGAGCAGG ATATTGACCA TGCGGGTCTG ATTGAACGGT TCGACCGGGA GGGTTATACA CGGGGCGAGC GGATTTACAA TGCGGTTTGT ATCAATTGCC ACGGCAATCC GGAGTCGGAA GGGTCGATTC CGATGTCGCA CAAGTTCTGG AAGCAGCCGT TCAAGGCCGG TTACGACCCG TTTTCAATGT ACCAGACCAT CACCAAAGGG TACGGGTCTA TGCCGCCCCA GATGACGCTC AGCCCGCAGG AGAAGTATGA CGTAATTTCG TATATCCGGG AGAATTTCGT CAAGGAAAGT AACTCCAGCC AATACCTCCG CGTAACGCCC GGTTACCTCG CCAGTCTGCC GAAGGGAACG TCAAGAGGCC CGGCGGTAAA ACCCTATCAT CCCTGGTCGG ACATGGACTA TGGGAGTTTC TTTATCAATA CGTACGAACT GGCCGATTCG GCTACAGGCC CGAAACGGTT CCATTCGCCG GGACGCCCTC CATATGCTGA CGAAGATTAT CGGAAAAATA ATTTTGCCTA TAAAGGCATT GCCGTTCGGG TCGATACCGG GGCTGGAGGT GTGTCGGCCG GGAAGGCCTG GATGATGTTT GACCACGACG TCATGCGCGT AGCCGGTGGC TGGACGGGCA GGGGATTTAT TGACTGGGAG GGAATTCTGC TCAACGACAA CCACGAAACC TACCCCCGAA CCATTGGCCA GCTACATTTC GAAACGCCCG TTGGCCCCGG CTGGGCAAAC CCCGTTACCG GCTCGTTCAA CGACCCCCGA TTTACGGCGC GGGACGGTCG TCAGTTTGGG CCGTTGCCCA AAACGTGGGC CAACTACAAA GGGCTGTATC ATCATGGAAA CACGGTAATT ATTGCCTACA CCGTCGGCCA GGCGGCTATT CTGGAAAAAT TGGGGATGGA AACGGTTGAG GGCCAACCGG TTTTTACACG AACCCTGACC ATTAATCCGG GTCAGCAACT GCTGAAAACC AGAGTGGCGC GGGTCGGCAC CAACGTCGCG GTCGTTGGGA AAGGAGCATC GCTGACGGAG GAAAACGGAT ACGTTGTTTT GCAGATTACG GCCTCGGCAA AAACCACCAT CAAACTGTTG ATGGCCCGGC CCGGTAGCAA AAGCCTGACC GCTCTGGCGG CCCGTTCACC AACGCCCGAA TCGCTCGAAA AATACACCAA AGGCGGGAAG GCGCATTACC CACAAACGCT GTACACGACC ATTACGCGGG GAAAAGACGA CGGCTTGTTT GCGGTGGATC AACTGTCGCC ACCCTACGAC AATCCGTGGA AATCGCGGAT GAAGCTCAGC GGTATCGACT TCATGAAGAA CCCCAATGTG GCGGTTTTAT GCACCACCGA TGGCGACGTG TGGCGGGTGA CGGGCCTGAC CGATGCCAGC GGGCGTCTTG GCTGGAAGCG CATTGCTTCG GGGCTTTTTC AGCCGCTGGG TATCAAAGTG CTGAACGAAA AAATATACGT TACCTGCCGC GACCAGCTGG TGCTACTTCG TGATCTGAAC GGCGACGGCG AAACCGATTT TTACGAAAGC TTCAACCACG ACCATCAGGT GACGGATCAT TTTCACGAAT TCGCGATGGG CTTGCAGGCT GATAAAGAAG GCAATCTGTA TTATGCCAAA AGTGGTCGTC ATGCCCGCGA AGCCCTGATT CCCCAGCACG GAACCCTGCT GAAAGTCAGC AAGGATGGGG CCACCACGCA GATCATGGCT ACCGGGTTCC GAGCTGCCAA CGGCGTGTGC ATCAACCCCG ACGGTAGTTT TATCGTGACC GACCAGCAGG GCTACTGGAA CCCTATGAAC CGCGTCAACT GGATTGAGGG CAAGGGGAGG TTCTACGGTA ATATGTGGGG CTACAACCCG CCCAACGACT CCACCAAAGC CGCTATGGAG CAGCCGCTGG TTTGGATTGA TATGGAGTTC GACCGGTCGC CGTCGGAGTT GCTGTGGGTG GACAGTAAGA AATGGGGGCC ACTGAACGGC AGCCTGCTCA GTTTTTCGTA CGGCTATGGC AAGATTCAAC TCGTGCTACC CGAAGACGTG AATGGCCAGA AGCAGGGTGG CGTGATCGAC CTGCCGGGGG TTAAATTCCG AACCGGCATC ATGCGCGGTC GCTTCAACCC CGCCGATGGG CAGCTATACG CCTGTGGCAT GTCGGCCTGG GGAACGTCGC AAACGATTAA AGGTGGCGAT TTCTATCGGC TTCGCTATAC CGGAAAATCC CTCCCGCTCC CCACCCGGCT GAGTGCCGAA ACGGATGGTG TTACATTGAC ATTTGCTCGC GAGCTGGCCG AAAACAGCAC CCGGAATCCT GCCAGTTTTG TGGTGAAAAG CTGGGACATT CGGAGGAGCC GGAAGTATGG TTCGGACCGC TACAATATAC AAACATTGAC CGTCGCGGAT GTGAAACTAA GCCGGGATAA ACGAACCGTA AAACTCCTAC TTCCCGGCAT AAAACCCGTC GATGTCATGA CGATTGCCTA CACCGTTGCC GATACGAAAG GGCAGACGTT TACGGGCACC GTACAGAACA CGATTCATGC ATTACGAAAA GCCAGTAATC AGGTTTTTAC TGCCGTCAGG TAA
|
Protein sequence | MTRNTVSRFV LLVLIVCFSG FVADPLEGAW DCSINLDGKL VGTLKVSRVN ASYTGVFFSF ERGEIPVHHV MVDGNKLSGD FTLRKEVYQL VAEATGNTLT GSVVSGAKRF TLLANRITEP AQAYVPPRVS YILADKELNG FEQDIDHAGL IERFDREGYT RGERIYNAVC INCHGNPESE GSIPMSHKFW KQPFKAGYDP FSMYQTITKG YGSMPPQMTL SPQEKYDVIS YIRENFVKES NSSQYLRVTP GYLASLPKGT SRGPAVKPYH PWSDMDYGSF FINTYELADS ATGPKRFHSP GRPPYADEDY RKNNFAYKGI AVRVDTGAGG VSAGKAWMMF DHDVMRVAGG WTGRGFIDWE GILLNDNHET YPRTIGQLHF ETPVGPGWAN PVTGSFNDPR FTARDGRQFG PLPKTWANYK GLYHHGNTVI IAYTVGQAAI LEKLGMETVE GQPVFTRTLT INPGQQLLKT RVARVGTNVA VVGKGASLTE ENGYVVLQIT ASAKTTIKLL MARPGSKSLT ALAARSPTPE SLEKYTKGGK AHYPQTLYTT ITRGKDDGLF AVDQLSPPYD NPWKSRMKLS GIDFMKNPNV AVLCTTDGDV WRVTGLTDAS GRLGWKRIAS GLFQPLGIKV LNEKIYVTCR DQLVLLRDLN GDGETDFYES FNHDHQVTDH FHEFAMGLQA DKEGNLYYAK SGRHAREALI PQHGTLLKVS KDGATTQIMA TGFRAANGVC INPDGSFIVT DQQGYWNPMN RVNWIEGKGR FYGNMWGYNP PNDSTKAAME QPLVWIDMEF DRSPSELLWV DSKKWGPLNG SLLSFSYGYG KIQLVLPEDV NGQKQGGVID LPGVKFRTGI MRGRFNPADG QLYACGMSAW GTSQTIKGGD FYRLRYTGKS LPLPTRLSAE TDGVTLTFAR ELAENSTRNP ASFVVKSWDI RRSRKYGSDR YNIQTLTVAD VKLSRDKRTV KLLLPGIKPV DVMTIAYTVA DTKGQTFTGT VQNTIHALRK ASNQVFTAVR
|
| |