Gene Tcr_1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_1817 
Symbol 
ID3761389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp1991836 
End bp1993308 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content51% 
IMG OID637786561 
Producthypothetical protein 
Protein accessionYP_392083 
Protein GI78486158 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCAGC TGTACACCAC TCAAAGCACG CAAGCGATTG AACGTTTTGC CATTGATCAG 
CAAAGCATCC CTGGACTCCT TTTGATGAAA CGTGCCGCTT ATTTTAGCTA CCAGACCCTC
CGAAGATGCT ACCCCGATTC ACAAAATGTT TTAGTCGTCT GCGGCACGGG CAACAATGGC
GGCGATGGGT TGGCTCTGGC GCAGTATGCA CTGATTGACG GCTGCAACGT TTCCATTGCG
TTACTGGGAT CACAAGATAA AATTAAAGGC GACGCCCAAA CCTGTTTGCA AGAATGCCTG
GCACTAGGTC TCTCGCCGCA ACCGTTTGAT TCCACGTTGC TTGAAAATGT CGATACGATT
GTGGACGCGG TGTTTGGCAC AGGTTTGAAC CAACCCGTGA CCGGAGAGTA CGCTGAGATT
TTTGAACGCT TGAACGAAAC GCATACGCCC ATTCTGGCAC TCGACATTCC CAGCGGCTTA
CAGGCGGATA CCGGCAACAT CTTAGGCACG GCTATCCGCG CCAGCCACAC TTGCACGTTC
ATCACCCATA AGCCGGGGCT CTATACTTAC CTTGGCCCCG AAACGGCAGG CAAAATTCAT
TTTAGTCCAT TATTTTTAAG CCAGAACAAC TACGCCGAGC AATCCCCTAT TGCTGAAAGT
CACTCTCTCA AATACTGGCT GAACCAACTG CCTAAAACCC CCGCATCAAG TCATAAAGGC
ACACGAGGCA CACTCTTACT GATCGGCGGA AACCATCATA TGATGGGCGC CATTCAATTA
GCCAGCCTGG CCGCTTTAAC CACCGGGGCA GGCCTGGTCA AAATCATCAC ACAACCTGAT
CATTTAACGG CATTAACCCA GGCACAGCCT GAGCTTATGA CATACACCGA ACACGAATTT
GAACAGCAAG CGGCCACGGC TAACGTCATT GGTATCGGCC CCGGGCTGGA TCAGGATGAT
TGGGCAATCG ATCGTTTCCA CGACGCACTT AACCACAGCA GTCCTAAAGT TTTAGACGCC
GATGCGTTAA ATCTGTTAGC ACAATCCCCG CAACAACAAA ACCATTGGGT TCTCACCCCT
CACCCGGGAG AAGCGGCCAG ATTGCTGGGA ACATCAACGG AAACCATTCA AAGCAATCGA
ATCGAAGCCA TCAAAAGGCT GCAGCAAAAA TACGGTGGGG TCATTGTGTT AAAAGGCAAT
GGCACTCTGG TTTACGATGG CAAGCAAATG GAATTGTGTA CCGCAGGCAA TGCCGGCATG
GCGGTGGGCG GAATGGGAGA TGTTCTAACC GGTGCGATCA CCAGCTTTAT CGCGCAAGGC
ATGGCGTTAT ACCCCGCAGC GTGTTTAGCC GTTTCTTTAC ATGCACACAG CGGCGACACT
CTGGCCAATC AAAAAAGCCA AGCCGGGGTC ATTCCCTCCG ACTTGGCTTT AGTGATGAGC
CAGTTGTTAA GCTATGCCAG CAAAAACTCC TGA
 
Protein sequence
MMQLYTTQST QAIERFAIDQ QSIPGLLLMK RAAYFSYQTL RRCYPDSQNV LVVCGTGNNG 
GDGLALAQYA LIDGCNVSIA LLGSQDKIKG DAQTCLQECL ALGLSPQPFD STLLENVDTI
VDAVFGTGLN QPVTGEYAEI FERLNETHTP ILALDIPSGL QADTGNILGT AIRASHTCTF
ITHKPGLYTY LGPETAGKIH FSPLFLSQNN YAEQSPIAES HSLKYWLNQL PKTPASSHKG
TRGTLLLIGG NHHMMGAIQL ASLAALTTGA GLVKIITQPD HLTALTQAQP ELMTYTEHEF
EQQAATANVI GIGPGLDQDD WAIDRFHDAL NHSSPKVLDA DALNLLAQSP QQQNHWVLTP
HPGEAARLLG TSTETIQSNR IEAIKRLQQK YGGVIVLKGN GTLVYDGKQM ELCTAGNAGM
AVGGMGDVLT GAITSFIAQG MALYPAACLA VSLHAHSGDT LANQKSQAGV IPSDLALVMS
QLLSYASKNS