Gene Cag_0603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0603 
Symbol 
ID3746296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp706047 
End bp707753 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content49% 
IMG OID637773137 
ProductRecJ exonuclease 
Protein accessionYP_378919 
Protein GI78188581 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAT ATCGTTGGAA GTGCTTCATG CCACATGAGG AAACAGTTGC GGCATTATCG 
GAGTCCATTA ATGTGTCGCA GCCTATTGCG CGTGCGTTGT GTAATCGTGG CATTTCCACT
TACAACGAGG CTAAAGAGTT TTTTCGTCCT GTGCTTTCCA CCCTTCACTC TCCATGGCTT
TTTAACGATA TGGAGCGTGC GGTAGAGCGT TTGGTTCGTG CTCTTAAAAA TGGTGAAACC
ATTTTGTTGT ATGGCGATTA CGATGTTGAT GGCACTACGG GCGTAGCGCT TTTGCTGCTC
TTTTTGCGCC ATCACGGCGT TGAGCCGTTG TGGCATATTA ATGATCGTTT TGCGGAAGGG
TACGGTTTAT CGCCCGAAGG AATTGATCGA GTTATTGCAA GCGGTACAAC CTTGCTGATA
ACGGTGGATT GCGGTATTAA AGATCATGCG GCTATTCGGC GTTGCGGGGA GCATGGCGTT
GAGGTGATTG TGTGCGATCA CCACGAAGCC GATGTTACGC CTGAAGCCTA TGCTATTTTA
AATCCTAAAG TGGTTGGTTC GGGCTATCCT TTTCGTGAAC TGTGCGGGTG TGCGGTGGCG
TTTAAGTTGG TGCAAGCGCT TGCTGAGCGT TTGGGTGATA GTGAGGCGGT GTGGCATCAA
TTTCTTGATT TAGTTGCCGT TGCAACGGCG GCGGATTTGG TGTCGCTCAC GGGCGAAAAT
CGCACGTTGG TGATTGAAGG GTTGCAGCAA ATGCGCTCTA AGCCACGCAA AAATTTTAGC
GAAATGTTTC GGGTTATGAA GGTTTCGCTT GGCGATGTTC GGATGTTTCA TCTTGCTTTT
GGCATTGCGC CGCGCATTAA TGCGGCTGGA AGAATGCACT CGGCGCATCT TGCGCTTGAG
TGGCTGCTGG CAAGTGCGCC CGATGCGGTG GAGCAGCACA CGGAGGCGCT TGAGCGGGTG
AATGTGCAGC GCCGTTCGCT TGATAGCACC ATTATGTCGC AAGCTGATAA GATGGTTGAA
AGCCATTGTG CCTCCTACTG CTCTTCCATT GTGCTCTACG ATGAGGCGTG GCATCTTGGG
GTGTTGGGCA TTGTGGCATC TAAACTTATT GATAAATATT ATTTGCCAAC GGTTGTGTTG
GGGGGAATGA ATGGCTTGGT TCGAGGTTCG GTACGAAGCA TTGAAGGGTT GAATATTCAT
GCGGTGCTTC AGCACTGTAG CCACCATCTT GAGCAATTTG GTGGACACCA TCAAGCGGCG
GGTTTAACGC TCAAGCCTGA AAATCTGGCG GTATTTCGCA AAGCTTTTGA CGAGCAGTGT
GCCAATCAGC TTACCATTGA GCAGCGCCAA AAGGTGATGG AAATTGATGC AGTGGTTGAG
CTTGAGCAGA TTACCGACAA ATTTATTGCG GTGTTGGAAC AATTTGCCCC TTACGGCATT
GGCAACCGTG AACCACTTTT TATGAGTGAA CGGCTGCAAC TTGCTGAGCC TGCTCGACTT
TTAAAAGAGC GCCATGTAAA GTTTGCTGTG CGCGATAAGC AAAAGCGCCG TTTTGAGGTG
ATAGGCTTTA ACCGCCCCGA TATTTATAAT GATTTACGAG CTGTAAAGCA TCCAACGATT
ACCATGCTTT ACACTATTGA GCGCCGCCAA TGGAATGGCA TGTGGCAAGT GCAACTCTTA
TTGAAGGATT TGGAGGTGCA GCGCTAA
 
Protein sequence
MKRYRWKCFM PHEETVAALS ESINVSQPIA RALCNRGIST YNEAKEFFRP VLSTLHSPWL 
FNDMERAVER LVRALKNGET ILLYGDYDVD GTTGVALLLL FLRHHGVEPL WHINDRFAEG
YGLSPEGIDR VIASGTTLLI TVDCGIKDHA AIRRCGEHGV EVIVCDHHEA DVTPEAYAIL
NPKVVGSGYP FRELCGCAVA FKLVQALAER LGDSEAVWHQ FLDLVAVATA ADLVSLTGEN
RTLVIEGLQQ MRSKPRKNFS EMFRVMKVSL GDVRMFHLAF GIAPRINAAG RMHSAHLALE
WLLASAPDAV EQHTEALERV NVQRRSLDST IMSQADKMVE SHCASYCSSI VLYDEAWHLG
VLGIVASKLI DKYYLPTVVL GGMNGLVRGS VRSIEGLNIH AVLQHCSHHL EQFGGHHQAA
GLTLKPENLA VFRKAFDEQC ANQLTIEQRQ KVMEIDAVVE LEQITDKFIA VLEQFAPYGI
GNREPLFMSE RLQLAEPARL LKERHVKFAV RDKQKRRFEV IGFNRPDIYN DLRAVKHPTI
TMLYTIERRQ WNGMWQVQLL LKDLEVQR