Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0619 |
Symbol | |
ID | 8806368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 658644 |
End bp | 659774 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | polysaccharide export protein |
Protein accession | YP_003459870 |
Protein GI | 289207804 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.112684 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.159669 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTGA TCCCCCCCGT ACGCTGGGTC CTGTTGCTGA GCCTGCTGGT GATATTGGGT GGGTGTGCCT GGGCCCCTGG CGGGCACATT GCCGAACGCA CCTCCAGTGC CCCGGTCGAG GACCTGGTCG ATATCGAGCC CATAACCTTT GGCCTGATCC GGGCCCAGGA GACCCGCTCG GTACCGCCAG ACTTGCGCCG TGTAAGCGAC GAGATGGCAC AGGACGTCGA TGAATACGAT TACCGCATCG GTCGCGGCGA CGTGCTGGCG GTCATCGTGT ACGAACACCC CGAGCTGACC ATTCCGGCCG GCAGCGAGCG GTCCGCTGTG GAATCGGGGA ATACCGTGCA CCCCGACGGG ACCATCTTTT ATCCCTATAT CGGGCGCGTC GAAGTCGAGG GGTACACCGT GTCGCAGGTG CGTGACCGGA TCGCGCGCGA TCTCGCGACC TTCGTGAACG AGCCCCAGGT CGAGGTGCGC GTGGCGGCCT TCAACTCGCA GAAGGTCCAG GTGACAGGCT CGGTACGCGA ACCCGGCGTG CAGCCAGTGA CCAATGTGCC GCTGACCGTG CTCGATGCGA TCCACCACGC GGGCGGGCTG TCCGACGATG CCAACTGGCA CGAGGTGATC CTGACACGGG AAGGCGAAGA GGTCGTGATC TCGCTCTACG ACATGTTGCG CTACGGCGAC ATGTCGCAGA ACCGGCTCCT GCGCGATGGC GACGTGCTGC ACGTCCCGGA TACTGCCGGC CAGCAGATCT ACGTGATGGG CGAAGTGGGC GACCCGCAAC GGCTGCCCCT GGGCCGCGGT CAGCTCACGC TGATGGATGC GCTCAGCCAG GCCGGAGGGT TTAACGAGAC TCGGGCGGAC GCCAGCGGCA TCTTCGTGAT CCGCCGTGCA CCACCGGACA GCGACAAGCT CGCGACCGTC TACCAGCTGG ATGCGCGCAA TTCCGCAGCG CTGATGCTGG GGGCCGAGTT CGAGCTGGAG CCGCTGGACA TCGTGTATGT GACGACGACG TCGCTGGGCC GCTGGAACCG CGTGATCAGT CAGCTGCTGC CCACGGTCAG CGCGGTTTAC CAATCCTCCC GTACCACGCG CGACGTGCGC CGCCTGTCGG ATGACTTCTA G
|
Protein sequence | MNVIPPVRWV LLLSLLVILG GCAWAPGGHI AERTSSAPVE DLVDIEPITF GLIRAQETRS VPPDLRRVSD EMAQDVDEYD YRIGRGDVLA VIVYEHPELT IPAGSERSAV ESGNTVHPDG TIFYPYIGRV EVEGYTVSQV RDRIARDLAT FVNEPQVEVR VAAFNSQKVQ VTGSVREPGV QPVTNVPLTV LDAIHHAGGL SDDANWHEVI LTREGEEVVI SLYDMLRYGD MSQNRLLRDG DVLHVPDTAG QQIYVMGEVG DPQRLPLGRG QLTLMDALSQ AGGFNETRAD ASGIFVIRRA PPDSDKLATV YQLDARNSAA LMLGAEFELE PLDIVYVTTT SLGRWNRVIS QLLPTVSAVY QSSRTTRDVR RLSDDF
|
| |