Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3393 |
Symbol | |
ID | 8727146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4101109 |
End bp | 4102161 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_003388200 |
Protein GI | 284038270 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.179766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATT ATCTGTTTAT ACTGCTGGCG CTGGTGATCG CCTGCCAGTC GGCTGACAAG CCAAACCCCA CGCCTACCCC ACCCGTCACG TCGGTCGACA CGGTCGGCAA GTTCCAGAAT CCGTTGCTCA CCGCAGCTCC CGACCCCTGG GTGACGTATA AAGATGGCTT CTATTACGTG ATGCACACCA CCGGAAACGA TCTGCGCATC TATAAAACAG CCGCCATGAG TCGTCTGGGT TCGGCACGTC CGGTAACCGT CTGGACACCG CCCACCACCG GCCCCAACTC CCGCGACATC TGGGCACCGG AACTCCACTT CGTCAACGGC AAATGGTACA TCTACTACGC TGCCGACAAC AACGGCAACG ACGCGACCCA CCGCATGTTT GTGCTAGAAA ACGAATCCGC CGACCCGACA CAGGGAACGT GGGTCGACCG GGGGCAGCTC AATTTACCCG AAAACCGATG GGCAATTGAC GGTACACTAC TTCAGCAAAA TGGTCAGTTG TATTTCGCCT GGTCGGGTTG GCAAACGGAT CAGGGCGGTT CACAGCACAT TTACATGGCC AAACTATCGA ATCCGTACAC CGTCGAAAGT ACGGCTCGAA TTCGCCTGTC ATCGCCTGAT TATGCCTGGG AGAAAAACGG TTTTGGCGTG AATGAAGGCC CCGAGTTTCT GGTGCATAAC AACAAAGTCT TTCTGATTTA TTCGGCCAGC TTCTGCGGAA CGGATCTGTA CGCACTGGGC CAGCTCTCCG CCGATACCAG CGCAGATTTG CTCAACCCGG CATCCTGGAC GAAATCAGCT ACGCCAGTTT TTGGTCCCAA CGGCGGCAAC GGCACCTTTG GCGTGGGTCA CAACAGTTTT TTCACTACCC CCGACGGAAA GGAAAACTGG CTCGTTTACC ACGCTAATGC CGCTTCAGGA CAGGGCTGTG GCAATAACCG CTCCATCCGC ATGCAGCCTT TTACGTGGAA AGCCGACGGG TCGCCCGATT TTGGCAGTGC AGCTCCAATA GCGCAATGGA TTAAGCGGCC TGGTGGCGAG TGA
|
Protein sequence | MKNYLFILLA LVIACQSADK PNPTPTPPVT SVDTVGKFQN PLLTAAPDPW VTYKDGFYYV MHTTGNDLRI YKTAAMSRLG SARPVTVWTP PTTGPNSRDI WAPELHFVNG KWYIYYAADN NGNDATHRMF VLENESADPT QGTWVDRGQL NLPENRWAID GTLLQQNGQL YFAWSGWQTD QGGSQHIYMA KLSNPYTVES TARIRLSSPD YAWEKNGFGV NEGPEFLVHN NKVFLIYSAS FCGTDLYALG QLSADTSADL LNPASWTKSA TPVFGPNGGN GTFGVGHNSF FTTPDGKENW LVYHANAASG QGCGNNRSIR MQPFTWKADG SPDFGSAAPI AQWIKRPGGE
|
| |