Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_6575 |
Symbol | |
ID | 8730361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 7983333 |
End bp | 7984469 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | thiazole biosynthesis protein ThiH |
Protein accession | YP_003391331 |
Protein GI | 284041401 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACAA CAACCTGGTT TGACGCCTAC GACTGGAACG CCGTTCAGCA GGATATTCAG GCCGTTACTC CGCAACAGGT GGAACGGGCA CTGGGCAATA CCCGCCGGAC GCTCGGTGAT TTTAAAGCAC TGATCTCACC GGCGGCTCAG GCGTATCTGG AACCGATGGC CCAGCTTAGC CACCAGCTGA CGCAGAAACG GTTCGGCAAG ACGATCCAGC TGTATGCGCC CCTGTATCTG TCCAACGAAT GTCAGAACAT CTGCACCTAC TGCGCCTTTA GTCTCGACAA TAAAATCGTC CGCAAAACCC TTACCGATGC TGAAATAGGG CGGGAGGTCA ATGCCCTGAA GCAACTGGGC TATGAGCATG TACTGCTGGT GACCGGTGAG GCAAATCAGA CTGTTGGGGT GCCGTACCTT CGAAACGCTA TTCGTCTGCT ACAGCCGCAC TTCGCCCATA TTTCCATGGA GGTGCAGCCC CTCGACCAGC CCGAGTACGA AGAACTAATC GCCGAAGGGC TCAACACGGT GCTGGTGTAT CAGGAAACGT ACCATCGGGA AACGTATAAA AACCATCACC CGAAGGGTAA GAAATCGAAT TTCGCCTACC GGCTCGATAC TCCCGACCGG CTGGGAAGGG CGGGCGTGCA TAAGATGGGC CTGGGTGCGT TGCTCGGGCT GGAAGACTGG CGTACGGATA GCTTTTTCAC GGCGGCTCAT CTGCACTACC TCGAACGCAA GTACTGGCAG ACCAAATACA GCATCTCCTT CCCGCGTCTC CGCCCCATCG ACTTATTGCA GGATGACACC ATCACCTCCA AATCGTTCGA GCGGTGCATG TCTGATCGGG ATCTGGTCCA ACTGATCTGC GCTTACCGGC TGTTCAACGA GGAGGTCGAA CTCTCCCTTT CAACCCGCGA AACGCCCCGC TTCCGCGATC ATGTTCTCAA GTTGGGTGTC ACCAGCCTGA GTGCGGGGTC CAAAACCAAT CCAGGTGGCT ATGCCGTTGA GCCGCAGTCG CTGGAGCAGT TCACCATTTC GGACGAACGG AGCCCCGCCG AGATGGTCGA CGTTATTCGG AGGCAGGGAT ACCAGCCGGT TTGGAAAGAC TGGGATAAAA CGCTCGCTGC CTTATGA
|
Protein sequence | MSTTTWFDAY DWNAVQQDIQ AVTPQQVERA LGNTRRTLGD FKALISPAAQ AYLEPMAQLS HQLTQKRFGK TIQLYAPLYL SNECQNICTY CAFSLDNKIV RKTLTDAEIG REVNALKQLG YEHVLLVTGE ANQTVGVPYL RNAIRLLQPH FAHISMEVQP LDQPEYEELI AEGLNTVLVY QETYHRETYK NHHPKGKKSN FAYRLDTPDR LGRAGVHKMG LGALLGLEDW RTDSFFTAAH LHYLERKYWQ TKYSISFPRL RPIDLLQDDT ITSKSFERCM SDRDLVQLIC AYRLFNEEVE LSLSTRETPR FRDHVLKLGV TSLSAGSKTN PGGYAVEPQS LEQFTISDER SPAEMVDVIR RQGYQPVWKD WDKTLAAL
|
| |