Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_1817 |
Symbol | |
ID | 3761389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | + |
Start bp | 1991836 |
End bp | 1993308 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637786561 |
Product | hypothetical protein |
Protein accession | YP_392083 |
Protein GI | 78486158 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCAGC TGTACACCAC TCAAAGCACG CAAGCGATTG AACGTTTTGC CATTGATCAG CAAAGCATCC CTGGACTCCT TTTGATGAAA CGTGCCGCTT ATTTTAGCTA CCAGACCCTC CGAAGATGCT ACCCCGATTC ACAAAATGTT TTAGTCGTCT GCGGCACGGG CAACAATGGC GGCGATGGGT TGGCTCTGGC GCAGTATGCA CTGATTGACG GCTGCAACGT TTCCATTGCG TTACTGGGAT CACAAGATAA AATTAAAGGC GACGCCCAAA CCTGTTTGCA AGAATGCCTG GCACTAGGTC TCTCGCCGCA ACCGTTTGAT TCCACGTTGC TTGAAAATGT CGATACGATT GTGGACGCGG TGTTTGGCAC AGGTTTGAAC CAACCCGTGA CCGGAGAGTA CGCTGAGATT TTTGAACGCT TGAACGAAAC GCATACGCCC ATTCTGGCAC TCGACATTCC CAGCGGCTTA CAGGCGGATA CCGGCAACAT CTTAGGCACG GCTATCCGCG CCAGCCACAC TTGCACGTTC ATCACCCATA AGCCGGGGCT CTATACTTAC CTTGGCCCCG AAACGGCAGG CAAAATTCAT TTTAGTCCAT TATTTTTAAG CCAGAACAAC TACGCCGAGC AATCCCCTAT TGCTGAAAGT CACTCTCTCA AATACTGGCT GAACCAACTG CCTAAAACCC CCGCATCAAG TCATAAAGGC ACACGAGGCA CACTCTTACT GATCGGCGGA AACCATCATA TGATGGGCGC CATTCAATTA GCCAGCCTGG CCGCTTTAAC CACCGGGGCA GGCCTGGTCA AAATCATCAC ACAACCTGAT CATTTAACGG CATTAACCCA GGCACAGCCT GAGCTTATGA CATACACCGA ACACGAATTT GAACAGCAAG CGGCCACGGC TAACGTCATT GGTATCGGCC CCGGGCTGGA TCAGGATGAT TGGGCAATCG ATCGTTTCCA CGACGCACTT AACCACAGCA GTCCTAAAGT TTTAGACGCC GATGCGTTAA ATCTGTTAGC ACAATCCCCG CAACAACAAA ACCATTGGGT TCTCACCCCT CACCCGGGAG AAGCGGCCAG ATTGCTGGGA ACATCAACGG AAACCATTCA AAGCAATCGA ATCGAAGCCA TCAAAAGGCT GCAGCAAAAA TACGGTGGGG TCATTGTGTT AAAAGGCAAT GGCACTCTGG TTTACGATGG CAAGCAAATG GAATTGTGTA CCGCAGGCAA TGCCGGCATG GCGGTGGGCG GAATGGGAGA TGTTCTAACC GGTGCGATCA CCAGCTTTAT CGCGCAAGGC ATGGCGTTAT ACCCCGCAGC GTGTTTAGCC GTTTCTTTAC ATGCACACAG CGGCGACACT CTGGCCAATC AAAAAAGCCA AGCCGGGGTC ATTCCCTCCG ACTTGGCTTT AGTGATGAGC CAGTTGTTAA GCTATGCCAG CAAAAACTCC TGA
|
Protein sequence | MMQLYTTQST QAIERFAIDQ QSIPGLLLMK RAAYFSYQTL RRCYPDSQNV LVVCGTGNNG GDGLALAQYA LIDGCNVSIA LLGSQDKIKG DAQTCLQECL ALGLSPQPFD STLLENVDTI VDAVFGTGLN QPVTGEYAEI FERLNETHTP ILALDIPSGL QADTGNILGT AIRASHTCTF ITHKPGLYTY LGPETAGKIH FSPLFLSQNN YAEQSPIAES HSLKYWLNQL PKTPASSHKG TRGTLLLIGG NHHMMGAIQL ASLAALTTGA GLVKIITQPD HLTALTQAQP ELMTYTEHEF EQQAATANVI GIGPGLDQDD WAIDRFHDAL NHSSPKVLDA DALNLLAQSP QQQNHWVLTP HPGEAARLLG TSTETIQSNR IEAIKRLQQK YGGVIVLKGN GTLVYDGKQM ELCTAGNAGM AVGGMGDVLT GAITSFIAQG MALYPAACLA VSLHAHSGDT LANQKSQAGV IPSDLALVMS QLLSYASKNS
|
| |