Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3853 |
Symbol | |
ID | 8727611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4624016 |
End bp | 4625206 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003388642 |
Protein GI | 284038712 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTATTG CCCTTAGCTT ATTAACTCTA CTTTTCTTTT TGACGGCTGT TGAAACGATC TACCTGCTCG TTTTCACGAT GGCCGGCCGG TTCGGACACC TGCAAGCCCC TCCGGTAAAT CCAAATCCTG TCGCCAAGCG GATGGCCGTG CTGATTCCGG CCTATAAGGA AGATGCCGTT ATTACCGACA CAGCCGAACA GGCCCTCAAA CAGGATTATC CGCGTGATGC CTACGATGTT ATTGTCATTG CCGATTCACT CCAGCAAGAT ACGCTCGACA AACTAGCCGC CCTGCCTATT ACGGTTGTTG AGGTAAGTTT TGACGTTTCG ACGAAAGCCA AGGCGCTGAA TGCGACCATG AGTAAGCTCA GTGCCGCGTA CGATGTGGCC GTCATTCTGG ACGCCGACAA TGTGATGGCC ACCAACTTCC TGACCCATAT CAATGCCGCA TTCAACGGCG GCTGGAAAGC GGTACAAGGC CACCGGGTAG CTAAAAACAC CAATACCAGC GTGGCTATTC TGGATGCCGT CAGTGAAGAA ATCAACACCT ACATACTCCG TCGTGGCCAC CGGGCCCTGG GGCTGTCGGG TAGTTTGATG GGCTCCGGTA TGGCGTTTGA CTATACCTTA TTCAAGCAGT ATATGGGTCA GATTAACACT ACAGGCGGAT TCGATAAAGA ACTGGAAATG CGGCTCATCC ACGACCACCA CCGGATCGAC TACATCGATG AGGCCTTGTG TTACGACGAG AAAGTACAAA GCGGGGCCGT ATTTGAACGG CAACGGGCCC GGTGGATTGC CGCTCAGCTC AAGTACCTGC GCCGTAACCT GCCCTCGGGC ATCGTTCAGT TACTGAAAGG CAATCTGGAT TACTTCGATA AGGTTTTCCA GACCATGTTT TTACCCCGGG TAATTCTGCT GGGCTTCCTC ACCATTGGTA CGGGTGCAGC GCTTGTCTTG CAGGATTCCA ACCTGCTTAT GCTCGCGGGA GGGCAGCTCC TGATTCTCTT ACTTACCTTT TACATCGCCA CGCCGAATGA GCTGCTGGCC CTGATCACCT GGAAAGAAAT AAGCCAGATT CCCGGTTTGT TCTTCCGCTT TTTACGGTCG ATAACCCGCC TGGGCGAAGC AAGCAAGAAG TTTATCAATA CCCCTCATTC AACCACAAGT ACAGCCATAA ACGAAGGATG A
|
Protein sequence | MTIALSLLTL LFFLTAVETI YLLVFTMAGR FGHLQAPPVN PNPVAKRMAV LIPAYKEDAV ITDTAEQALK QDYPRDAYDV IVIADSLQQD TLDKLAALPI TVVEVSFDVS TKAKALNATM SKLSAAYDVA VILDADNVMA TNFLTHINAA FNGGWKAVQG HRVAKNTNTS VAILDAVSEE INTYILRRGH RALGLSGSLM GSGMAFDYTL FKQYMGQINT TGGFDKELEM RLIHDHHRID YIDEALCYDE KVQSGAVFER QRARWIAAQL KYLRRNLPSG IVQLLKGNLD YFDKVFQTMF LPRVILLGFL TIGTGAALVL QDSNLLMLAG GQLLILLLTF YIATPNELLA LITWKEISQI PGLFFRFLRS ITRLGEASKK FINTPHSTTS TAINEG
|
| |