Gene Cag_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1020 
Symbol 
ID3746748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1368168 
End bp1369745 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content50% 
IMG OID637773549 
ProductYjeF-related protein-like 
Protein accessionYP_379325 
Protein GI78188987 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0522551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCCG TACTGACCGC ACAAGAAATG CAAGCTGCCG ACCGTGCAGC AATTGAAACG 
CTCCATATTA GCGAAGCACG GCTTATGGAG CTTGCGGGAC GTGAATGCTT ACGCCTTATT
TTGGATATGC TGGAACGAAA AAAGCTTGAT GGTTGCGGCT TTTTAATTCT TTGTGGTAAG
GGAAATAATG GGGGCGATGG CTTTGTGTTG GCTCGACATC TGCTCAATTA CGGAGCTGCG
GTTGATGTGG TGTTGCTTTA CCCCCCAAGC ATCTTGCAAG GCGTCAATCG TGAAGGCTTT
GCAACCTTGC AAGCCTATGA AGCCGAGCAA GCACCGCTTC GCATTTTTGA AGGTATTGAA
GAAGCACTCC CTTTTGTGGA GGAAAACCAC TACACCATGC TCATTGATGC CATGACGGGC
ACAGGCTTGC GCCTTGCTCG ACGTGGCATG GAGTTGGCGC CTCCGCTCTC CGATGGTATT
GAGTTGCTGA ACCGAATGCG CCACGAAAGC AACGCCACAA CGCTCGCCAT TGATATTCCA
TCGGGGCTTG AAGCCACTAC AGGTTTTGCT GCACAACCCG TTGTGGAAGC TGATGTAACC
GTAACCATGG CATTCCTGAA ACGCGGCTTT TTGTTGAACG ATGGTCCCGA ATGTGCTGGC
GATGTAAAAG TTGCTGAAAT TTCCATTCCC ACCTTTCTTA CCGAATCAGC AAGCTGCCGT
TTAATTGATC AAGAGTTTGC TGCCGAGCAT TTTTTGCTGC GTGAACCAAG TAGCGCAAAG
CAACACAATG GCAAAGTGCT CATGATTGTT GGCTCACAAA GCGCACAACA CTCCATGCTT
GGAGCGGCAA TCTTAGCCGC AAAAGCCGCC ATAAAAAGTG GCATTGGCTA CCTTTGCTGC
TCACTGCCAC AAGAGCTTGT CGGTGCCATG CACCTTGCGG TTCCTGAAGC CGTGCTTATT
GGGCGCGATG TAGATGTGCT CACGGAAAAA ATTGCATGGG CTGACTCTGT GCTTATTGGG
TGCGGTTTAG GGCGCAATGC TGAAGCGTTA GAGTTGGTGG AAATGCTGTT GCAAAGTGAA
ACCCTACAAA GTAAAAAGCT CATTCTTGAT GCCGATGCAC TGTTTGCACT CAGTACACTT
GATGCCATAA CGGCGTTACA AAAGTGCAAC CACGTACTGC TTACCCCTCA TTATGGCGAA
CTAAGCCGAT TATGCAACAT CCCTATAGCT GATATTGCAG CCAATCCCAT TGAAATTGCC
CATGAGTGCG CCTGTAATTT TAGCGCCACC ATGTTGCTTA AAGGCAATCC AACCGTTATT
GCCAATGGAA AATATCCCAT ATTGCTCAAC AATAGCGGCA CCGAAGCGTT AGCAACCGCA
GGTTCAGGTG ATGTGCTTGC TGGCTTGATA GCATCGCTTG CCGCCAAAGG TGCCACCCTT
CCCCATGCCG CCGCTGCTGC CACATGGTTC CATGGGCGGG CGGGCGACCT AGCACACGAC
GTTGCAAGTT TAGTAACGGC TACCATGGTT GCAGATGCCA TTGCACAAGC TATCGGTGAG
GTGTTTGAGG TGGAGTAA
 
Protein sequence
MQPVLTAQEM QAADRAAIET LHISEARLME LAGRECLRLI LDMLERKKLD GCGFLILCGK 
GNNGGDGFVL ARHLLNYGAA VDVVLLYPPS ILQGVNREGF ATLQAYEAEQ APLRIFEGIE
EALPFVEENH YTMLIDAMTG TGLRLARRGM ELAPPLSDGI ELLNRMRHES NATTLAIDIP
SGLEATTGFA AQPVVEADVT VTMAFLKRGF LLNDGPECAG DVKVAEISIP TFLTESASCR
LIDQEFAAEH FLLREPSSAK QHNGKVLMIV GSQSAQHSML GAAILAAKAA IKSGIGYLCC
SLPQELVGAM HLAVPEAVLI GRDVDVLTEK IAWADSVLIG CGLGRNAEAL ELVEMLLQSE
TLQSKKLILD ADALFALSTL DAITALQKCN HVLLTPHYGE LSRLCNIPIA DIAANPIEIA
HECACNFSAT MLLKGNPTVI ANGKYPILLN NSGTEALATA GSGDVLAGLI ASLAAKGATL
PHAAAAATWF HGRAGDLAHD VASLVTATMV ADAIAQAIGE VFEVE