Gene Cag_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0003 
Symbol 
ID3747796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp3251 
End bp4345 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content39% 
IMG OID637772526 
ProductRecF protein 
Protein accessionYP_378325 
Protein GI78187987 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.056932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATTGC AACGAACCAT TTTTTCTGGA TTTCGGAATC ATACCTCGTT GCTTTTTGAA 
CCATCTGAGG GTGTAACCAT TATTTATGGA GCAAATGGTT CAGGTAAAAC ATCACTGCTT
GAAGGCATTC ATTACGGCGC ACTAACAAAA GGACTCCTTG GTGCGCCTGA TAGTGAATGC
CTTTCGTTTG ATACTGAGGC TTTTACCCTT GATTCTCATT TTTTATCGGA TAGTAATATT
CCGATTCATG TACTTGTAAC GTATCAGCTT GAAGGTGAAA AGCAAGTTAT TGTGGATCGT
CAAGAGGTAA AACCCTTTTC ATCACATATA GGACGTATTC CCACTATCAC TTTTTCACCG
TATGAAATAT CTTTAGTAAG TGGTCCTCCT GCTGAACGTC GTCGCTTTTT AGATAGTGCT
ATTAGCCAAT TAGATCATCG TTATTTAGAT CGTCTGATTA CTTATCGTCG TATTTTACAG
CAGCGAAATG CGTTACTTGC GCAACTATCC TCTGGTGAAA AAAGTAATCG TAACACCTTA
CCTTTATGGA CAACACAACT TGCTGAATTA AGCGCATGGC TTGTAGAACG CCGCTTACTC
TTTCTTACCT CATTTTCTCC CTACTTCCAA CACTATTATC GTTACATTAT TAAGGGTGAA
GAGCCATCAA TAAATTACCG TTGTACCTCT TGCCCTCTCC ATGGTAATAC TACCTTTCAA
GAGCTGTATC AGCTTTTTCT ACAACGATAT TCTGATATTG AAGCACAAGA AATTCAACGA
GGGCAAACAC TTTTTGGAGC ACATCGTGAT GATGTTCTCT TTTTTTTAAA TGAAAAAGAG
ATTAAGCGTT ATGCTTCACA AGGGCAGTTA CGAAGCTTTT TAATCGCGTT AAAAATCAGC
CAAGCACATC TTTTTGCTGA TCACTTACAT GAACAACCGA TGTGCTTGTT TGATGATTTA
TTTAGCGAGT TAGATGGAGG GCGTATTGAG CAAATTCTTG CTTTATTAAA AGAGTGTGGA
CAAACAATTA TTACAGCGGT TGAACCACGT TATACGGAAG GAATTACACT CTGTGATATT
CAAGCGTTGA GGTAA
 
Protein sequence
MKLQRTIFSG FRNHTSLLFE PSEGVTIIYG ANGSGKTSLL EGIHYGALTK GLLGAPDSEC 
LSFDTEAFTL DSHFLSDSNI PIHVLVTYQL EGEKQVIVDR QEVKPFSSHI GRIPTITFSP
YEISLVSGPP AERRRFLDSA ISQLDHRYLD RLITYRRILQ QRNALLAQLS SGEKSNRNTL
PLWTTQLAEL SAWLVERRLL FLTSFSPYFQ HYYRYIIKGE EPSINYRCTS CPLHGNTTFQ
ELYQLFLQRY SDIEAQEIQR GQTLFGAHRD DVLFFLNEKE IKRYASQGQL RSFLIALKIS
QAHLFADHLH EQPMCLFDDL FSELDGGRIE QILALLKECG QTIITAVEPR YTEGITLCDI
QALR