Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3972 |
Symbol | |
ID | 8727730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4770633 |
End bp | 4772420 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003388761 |
Protein GI | 284038831 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000648751 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCTAT CTCACCCGAC CGGGACGGTT CGACGCATCG ACGGTCGGCA ATACTTACCA TACATGAATC GACGTAATTT TATTGAACAA CTATCCGTAA GCAGTGGAGC CGCGTTAACC GCTCCACTAC TTACCTTTTC GGGAGAATCA CACGCCCAGG CCAGTGGGCT GCTGAACCGG CTCGGGCAGG CCGAACGAGC CGCCGATGTC GTCATTGCCG GTGGTGGGCT GGGGGGCTGT GCGGCTGCTT TAGCCGCTTT GCGAAACCGC CTGACGGTCA TTATGACCGA AGAAACCGAC TGGATTGGCG GGCAGATGAC CCAGCAGGGC GTGCCACCCG ATGAACACCA ATGGATAGAA ACCCACGGGG CTACCCAGCT TTACCGCGAT CTGCGCACGG GTATTCGGGA TTATTACAAG CAGCATTATC CGCTGACGGA TGCTGCCAAA GCGAGCAAAT TCCTGAATCC AGGCGATGGA GCCGTATCGC GCCTGTGTCA TGAGCCGAAG GTTGCCCTGG CGGTTTTACA GCAGATGATG GCGCCGTACC AAAGTTCCGG CCAGTTGACG TTGCTGCTCG AACACAAGAT CACCTCGGCC GATGTTCAGG GCGATAAAGT TCGTGCGCTG AAAGCCATCA GCCAGCGAAC GGGGAAAGAA ACGGTGCTAA CGGCCCCCTA TTTCGTCGAT GCCACCGAAC TCGGCGACCT GTTGCCCCTC ACCGGTACCG AGTTCGTTAC GGGTGCCGAA GCCCGTTCCG AAACGCGCGA ACTCCACGCC CCCGACAAAG CCGACCCCAA CAACTGCCAG GCGTTTACGG TCTGCTTTGC GATGGATTAC ATAGCGGGGG CGAATCACGT TATCGATAAG CCCAAAGACT ACGCTTTCTG GCGGAACTAC TCGCCCAAGG TTACGCCCGC ATGGTCGGGG AAACTGCTCG ACCTGTCGTA TTCGAACCCG AAAACACTGG AGCCCAAACA GCTTGGTTTC CACCCGGAAG GCATTGCCAC CGGCGATAAA CTGAACCTCT GGAATTACCG TCGGGTGATC AGCAAGGCCA ATTTCAAACC CGGAACGTAT GCTGGCGATG TGTCGGGCGT GAACTGGCCC CAGAACGATT ACAACGCCGG AAACCTGATT GGAGCCAGCG AGAAAGATTT TAAGAAATAC GTTGAGCAGG CCAAGCAACT GAGTCTGTCG TTGCTCTACT GGCTCCAGAC CGAAGCCCCC CGCCCCGACG GCGGACAGGG CTGGCCGGGC ATCCGATTTC GGCCGGATGT GATGGGTAGC GAAGACGGGC TGGCCAAATA CCCATATGTT CGGGAATCCC GCCGAATCAA AGCCGTCTTT ACCGTTCTCG AAGAGCATGT CGGTGCCGAA AACCGGGCGA TGATCACGGG TAAAAAAGAG GGCAACACCT CAGCCGATTT TCCGGATAGT GTGGGTGTGG GCTATTACCA CATTGATCTG CACCCCAGTA CCGGCGGCAA CAATTATATT GACTTTAGCT CCATGCCGTT TCAGATTCCG CTGGGGGCTC TGCTGCCTAA ACGTATGGAA AATCTGTTAC CCGCCAACAA AAACATCGGC ACTACGCACA TCACCAACGG CTGTTACCGG CTGCACCCCG TCGAATGGAG CATTGGCGAA GCGGTAGGGA TGCTGGTCGC CTATTCCTTC AACAAAAAAG TAATCCCCCG TGCAGTTCGG GAGAAGGAAC AACTTTTGGG TGATTTTCAG AAGCTGATTC GTTCGCAGGG TATAGAAACG AATTGGCCTA AAGCGTAG
|
Protein sequence | MTLSHPTGTV RRIDGRQYLP YMNRRNFIEQ LSVSSGAALT APLLTFSGES HAQASGLLNR LGQAERAADV VIAGGGLGGC AAALAALRNR LTVIMTEETD WIGGQMTQQG VPPDEHQWIE THGATQLYRD LRTGIRDYYK QHYPLTDAAK ASKFLNPGDG AVSRLCHEPK VALAVLQQMM APYQSSGQLT LLLEHKITSA DVQGDKVRAL KAISQRTGKE TVLTAPYFVD ATELGDLLPL TGTEFVTGAE ARSETRELHA PDKADPNNCQ AFTVCFAMDY IAGANHVIDK PKDYAFWRNY SPKVTPAWSG KLLDLSYSNP KTLEPKQLGF HPEGIATGDK LNLWNYRRVI SKANFKPGTY AGDVSGVNWP QNDYNAGNLI GASEKDFKKY VEQAKQLSLS LLYWLQTEAP RPDGGQGWPG IRFRPDVMGS EDGLAKYPYV RESRRIKAVF TVLEEHVGAE NRAMITGKKE GNTSADFPDS VGVGYYHIDL HPSTGGNNYI DFSSMPFQIP LGALLPKRME NLLPANKNIG TTHITNGCYR LHPVEWSIGE AVGMLVAYSF NKKVIPRAVR EKEQLLGDFQ KLIRSQGIET NWPKA
|
| |