Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1020 |
Symbol | |
ID | 3746748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1368168 |
End bp | 1369745 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637773549 |
Product | YjeF-related protein-like |
Protein accession | YP_379325 |
Protein GI | 78188987 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0522551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACCCG TACTGACCGC ACAAGAAATG CAAGCTGCCG ACCGTGCAGC AATTGAAACG CTCCATATTA GCGAAGCACG GCTTATGGAG CTTGCGGGAC GTGAATGCTT ACGCCTTATT TTGGATATGC TGGAACGAAA AAAGCTTGAT GGTTGCGGCT TTTTAATTCT TTGTGGTAAG GGAAATAATG GGGGCGATGG CTTTGTGTTG GCTCGACATC TGCTCAATTA CGGAGCTGCG GTTGATGTGG TGTTGCTTTA CCCCCCAAGC ATCTTGCAAG GCGTCAATCG TGAAGGCTTT GCAACCTTGC AAGCCTATGA AGCCGAGCAA GCACCGCTTC GCATTTTTGA AGGTATTGAA GAAGCACTCC CTTTTGTGGA GGAAAACCAC TACACCATGC TCATTGATGC CATGACGGGC ACAGGCTTGC GCCTTGCTCG ACGTGGCATG GAGTTGGCGC CTCCGCTCTC CGATGGTATT GAGTTGCTGA ACCGAATGCG CCACGAAAGC AACGCCACAA CGCTCGCCAT TGATATTCCA TCGGGGCTTG AAGCCACTAC AGGTTTTGCT GCACAACCCG TTGTGGAAGC TGATGTAACC GTAACCATGG CATTCCTGAA ACGCGGCTTT TTGTTGAACG ATGGTCCCGA ATGTGCTGGC GATGTAAAAG TTGCTGAAAT TTCCATTCCC ACCTTTCTTA CCGAATCAGC AAGCTGCCGT TTAATTGATC AAGAGTTTGC TGCCGAGCAT TTTTTGCTGC GTGAACCAAG TAGCGCAAAG CAACACAATG GCAAAGTGCT CATGATTGTT GGCTCACAAA GCGCACAACA CTCCATGCTT GGAGCGGCAA TCTTAGCCGC AAAAGCCGCC ATAAAAAGTG GCATTGGCTA CCTTTGCTGC TCACTGCCAC AAGAGCTTGT CGGTGCCATG CACCTTGCGG TTCCTGAAGC CGTGCTTATT GGGCGCGATG TAGATGTGCT CACGGAAAAA ATTGCATGGG CTGACTCTGT GCTTATTGGG TGCGGTTTAG GGCGCAATGC TGAAGCGTTA GAGTTGGTGG AAATGCTGTT GCAAAGTGAA ACCCTACAAA GTAAAAAGCT CATTCTTGAT GCCGATGCAC TGTTTGCACT CAGTACACTT GATGCCATAA CGGCGTTACA AAAGTGCAAC CACGTACTGC TTACCCCTCA TTATGGCGAA CTAAGCCGAT TATGCAACAT CCCTATAGCT GATATTGCAG CCAATCCCAT TGAAATTGCC CATGAGTGCG CCTGTAATTT TAGCGCCACC ATGTTGCTTA AAGGCAATCC AACCGTTATT GCCAATGGAA AATATCCCAT ATTGCTCAAC AATAGCGGCA CCGAAGCGTT AGCAACCGCA GGTTCAGGTG ATGTGCTTGC TGGCTTGATA GCATCGCTTG CCGCCAAAGG TGCCACCCTT CCCCATGCCG CCGCTGCTGC CACATGGTTC CATGGGCGGG CGGGCGACCT AGCACACGAC GTTGCAAGTT TAGTAACGGC TACCATGGTT GCAGATGCCA TTGCACAAGC TATCGGTGAG GTGTTTGAGG TGGAGTAA
|
Protein sequence | MQPVLTAQEM QAADRAAIET LHISEARLME LAGRECLRLI LDMLERKKLD GCGFLILCGK GNNGGDGFVL ARHLLNYGAA VDVVLLYPPS ILQGVNREGF ATLQAYEAEQ APLRIFEGIE EALPFVEENH YTMLIDAMTG TGLRLARRGM ELAPPLSDGI ELLNRMRHES NATTLAIDIP SGLEATTGFA AQPVVEADVT VTMAFLKRGF LLNDGPECAG DVKVAEISIP TFLTESASCR LIDQEFAAEH FLLREPSSAK QHNGKVLMIV GSQSAQHSML GAAILAAKAA IKSGIGYLCC SLPQELVGAM HLAVPEAVLI GRDVDVLTEK IAWADSVLIG CGLGRNAEAL ELVEMLLQSE TLQSKKLILD ADALFALSTL DAITALQKCN HVLLTPHYGE LSRLCNIPIA DIAANPIEIA HECACNFSAT MLLKGNPTVI ANGKYPILLN NSGTEALATA GSGDVLAGLI ASLAAKGATL PHAAAAATWF HGRAGDLAHD VASLVTATMV ADAIAQAIGE VFEVE
|
| |