Gene Cag_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0004 
Symbol 
ID3747797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp4354 
End bp5295 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content45% 
IMG OID637772527 
Producthypothetical protein 
Protein accessionYP_378326 
Protein GI78187988 
COG category[R] General function prediction only 
COG ID[COG1090] Predicted nucleoside-diphosphate sugar epimerase 
TIGRFAM ID[TIGR01777] conserved hypothetical protein TIGR01777 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.128205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATC ACATTGTAAT AACAGGCGCA ACGGGCGTTA TTGGTGTTGA ACTTGCTCAA 
AAGCTCATAA AGCGCGGAGA AAAAGTGGTG CTGCTTGCAC GTTCACCAAA TGCCGCACAA
CAAAAAATTC CGGGGGCAGC CGCTTATGTC CGTTGGGATT CTGATATGCA AGAGGGAGAA
TGGAAAAGCA CTATCAGTGG AGCCAAAGCC GTTATTCATT TAGCAGGAAA ACCACTCCTT
GAAAGTCGCT GGAACGAAGA GCATAAGCAA GAATGCTACC AATCGCGCAT TATAGGGACA
CGTCATATTG TGGCGGCTAT TGCTGAAGCT GCTGAAAAAC CACAAGTTTT TATCTCCTCT
TCAGCAATTG GCTATTACGG CTCCTTCGAT AAATGTAGCG ACACGGCTCC TCTTACCGAA
TCAGGCAACA AAGGCAGCGA CTTTTTAGCC CACATTTGTA TTGATTGGGA AGAGGAGGCT
CGTAAAGCTG AAAACCTTGT GCCTCGCTTA GTGTTTTTGC GGACTGGCAT TGTGCTCTCT
ACACGCGGCG GCATGTTGCA AAAAATGATG ACTCCATTCC AATATTTTGC AGGTGGTCCA
ATTGGAACAG GGTTACAGTG CATCTCATGG ATTCACATGG ATGACGAAGT CAACGCTATT
ATTGCATCGC TTGATAATTC TGCTTACAAA GGAGCAATTA ATCTTGTAGC TCCAACGCCC
GTTTCAATGA AAGAATTTGC AAGCAAACTT GGAGCTGTTA TGGGGCGACC TTCGCTTTTG
CAAGTACCTG AATTTGCAGT CAAAATGCTT ATGGGCGAAG GGGGAGAATA TGCTGTTCGA
GGGCAAAAAG TGCTTCCTAC CTTTCTTGAA AAACAAGGTT TTACATTCCG TTACCCTGAC
CTTTCAAACG CACTTGGTGA TTTAATTAAG CACGGAAAGT AG
 
Protein sequence
MNNHIVITGA TGVIGVELAQ KLIKRGEKVV LLARSPNAAQ QKIPGAAAYV RWDSDMQEGE 
WKSTISGAKA VIHLAGKPLL ESRWNEEHKQ ECYQSRIIGT RHIVAAIAEA AEKPQVFISS
SAIGYYGSFD KCSDTAPLTE SGNKGSDFLA HICIDWEEEA RKAENLVPRL VFLRTGIVLS
TRGGMLQKMM TPFQYFAGGP IGTGLQCISW IHMDDEVNAI IASLDNSAYK GAINLVAPTP
VSMKEFASKL GAVMGRPSLL QVPEFAVKML MGEGGEYAVR GQKVLPTFLE KQGFTFRYPD
LSNALGDLIK HGK