Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1929 |
Symbol | |
ID | 8807702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 2051501 |
End bp | 2052691 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | O-succinylhomoserine sulfhydrylase |
Protein accession | YP_003461156 |
Protein GI | 289209090 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT TCGACCCCAG CGAATACGAC CCGCAGACCG TGGCCATCCG GCACGGCTAC CAGCGCACGC CCGAGGGCGA GCACTCCGAG GCGATCTTCC CGACCTCGAG CTTCGTGTTC GGCTCGGCCG CGGAGGCCGC GGCGCGCTTC GGTGGCGAGA GCCCGGGCAA CATCTACTCG CGCTTCACCA ACCCGACCGT GCGTACCTTC CAGGACCGCC TGGCGGCGCT GGAAGGGGGT GAGGCCTGTG TTGCCACCGC CTCCGGGATG TCCGCGATCC TCGCCACCAT GATGGGCCTG CTGAAGGCCG GCGATCACGT GGTCTGCTCC CGCTCGGTGT TCGGCACCAC CACCGTGTTG TTCAACAACT ACCTCGGGCG TTTCGGCGTG GAGGTCACCT ACGTCGAGCT GTCGAACCTC GAGGCCTGGG AGGCCGCGAC CCAGCCGAAC ACCCGGCTGT ATTTCTGCGA GACGCCGTCC AACCCGCTGG GCGAGGTGGT GGATATCCGC GCGCTGGCGG ACATCGCGCA TCGCCACGAC GTAGTGCTGG CGGTGGACAA CTGTTTCTGC ACGCCGGCCC TGCAACGCCC GCTGGATCTG GGCGCCGACA TCATCATCCA CTCGGCAACC AAGTTCCTCG ACGGCCAGGG CCGCTGCCTG GGCGGCGCGG TCGTGGGCGA CGCCGAGCGT GTGGGCGAGG AGGTCTACGG CTTCCTGCGC ACGGCTGGCC CCACGCTCTC GCCGTTCAAC GCCTGGGTCT TCCTGAAGGG CCTGGAGACC CTGGCCCTGC GGATGCGCGC GCACAGTGAC AATGCGCTGG CCCTGGCGCA GTGGCTGCAG GCCCACCCGA AGGTCACGGC GGTGCATTAC CCGGGTCTGC CGGATCACCC CCAGCACGCA ATCGCGAAGC AGCAGCAAAG CGGTTTCGGC GGGGTGCTGA GCTTCGAGGT CGCCGGTGGG CGCGAGGCTG CCTGGTCGGT GATCGACGCC ACGCAGTTCC TGTCGATCAC GGCCAACCTG GGCGATGCCA AGACCACCAT CACCCATCCG GCGACCACCA CCCACGGCCG CGTGGACCCG GACAAGCGCG AGGCACAGGG CATTACCGAG GCCATGGTGC GTGTCGCCGT AGGACTGGAG TCCGTCGCCG ATATCCAGCG CGACCTGGCC CGCGGGCTCG ACGCGCTCTG A
|
Protein sequence | MTDFDPSEYD PQTVAIRHGY QRTPEGEHSE AIFPTSSFVF GSAAEAAARF GGESPGNIYS RFTNPTVRTF QDRLAALEGG EACVATASGM SAILATMMGL LKAGDHVVCS RSVFGTTTVL FNNYLGRFGV EVTYVELSNL EAWEAATQPN TRLYFCETPS NPLGEVVDIR ALADIAHRHD VVLAVDNCFC TPALQRPLDL GADIIIHSAT KFLDGQGRCL GGAVVGDAER VGEEVYGFLR TAGPTLSPFN AWVFLKGLET LALRMRAHSD NALALAQWLQ AHPKVTAVHY PGLPDHPQHA IAKQQQSGFG GVLSFEVAGG REAAWSVIDA TQFLSITANL GDAKTTITHP ATTTHGRVDP DKREAQGITE AMVRVAVGLE SVADIQRDLA RGLDAL
|
| |