Gene Cagg_0434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0434 
Symbol 
ID7266602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp538220 
End bp539398 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content58% 
IMG OID643565301 
ProductDNA replication and repair protein RecF 
Protein accessionYP_002461815 
Protein GI219847382 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.626531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACATTC ATCACCTTGC TTTGCGCGAT TTTCGTAACT ACCGGCGGCA AGATGTGGCC 
CTCTCGCCGA CAACGATCCT CTTGTACGGC CCGAATGCTG CCGGTAAGAC GAGCCTCCTT
GAGGCTATTT TTTATCTCGC CACTACGCGC TCGCCTCGCC TCAGCAGCGA TCGCGACCTC
GTGCGCTGGG ATGCAGTCGG TGAAGCCGGC GCGCCACCTT TTGCCCGCAT TGCAGCCGAT
GTTGAGCGTC GGATCGGACC GGTACGGCTT GAGATACTGG TACAGCGCCG GCTCGATGAT
GGTGGTCAGC CGCTAAACGG CGCGCAAAAA TTGGTGCGGA TCGATAAGCG CCCGGCGCGC
GCGATTGATC TGATCGGTCA GTTGCGGGTA GTGCTCTTTA CCCCTACCGA TGTCATGCTG
GTTGATGGCC CCCCTGCCGA ACGGCGGCGC TACCTCGACA TTACCCTCTC ACAGCTCGAT
CCGCACTACG TGCGAACCCT GGCGTATTAC CAGAAGATTC TGTTGCAACG CAATAGTCTC
TTGCGCGCAT GGCGTGAACA ACGTCGCTTG CCCCGCAATG TTGATGCCGA GTTAGGGTAT
TGGGATCAAG AGCTGGCCGC TGCCGGAGGA TATTTGTTGG CCGAACGGTT GCGGGCCGTT
GTCGAATTGA GCGCGCTGGC CGGGTCCATT TACCGAAAGA TTAGCGGTGG CGAACACGAA
TTGCAGATTG AATATATCGC CAGTTGTGAC CTCGACGCAG CGCGCGATGC CGGTAGTCTA
GCCGAGCGAC TACGTTTAGC GTTTGCCGCT CAGCGTACCG ATGAGCTGGC CCGTGGGCAG
ACACTCTGCG GACCACACCG TGATGATCTG GTTTTTAACG TTGCCGGTGT GAATCTTGGT
CGGTATGGTT CGCGTGGTCA ACAACGTACC ATTGCCCTTG CGCTCAAGAT CGGTGAAGCC
GAACTGATGC AGCAACGCGG TGGTGATGCA CCGGTGCTCT TACTCGATGA TGTTTTGAGC
GAACTCGACA ACCGACGTCG CATGCATTTG CTTGACCTTA TTCTGCGCCC GCAACAGCAG
ACACTGCTCA CGGCCACCAA TCTGAGCGAC TTTAGTGCTG ATTTTCTCGC TGCCGCCCGT
CGCTTTCGGG TCGAAGATGG TCAGCTTTTT GCCGGTTGA
 
Protein sequence
MYIHHLALRD FRNYRRQDVA LSPTTILLYG PNAAGKTSLL EAIFYLATTR SPRLSSDRDL 
VRWDAVGEAG APPFARIAAD VERRIGPVRL EILVQRRLDD GGQPLNGAQK LVRIDKRPAR
AIDLIGQLRV VLFTPTDVML VDGPPAERRR YLDITLSQLD PHYVRTLAYY QKILLQRNSL
LRAWREQRRL PRNVDAELGY WDQELAAAGG YLLAERLRAV VELSALAGSI YRKISGGEHE
LQIEYIASCD LDAARDAGSL AERLRLAFAA QRTDELARGQ TLCGPHRDDL VFNVAGVNLG
RYGSRGQQRT IALALKIGEA ELMQQRGGDA PVLLLDDVLS ELDNRRRMHL LDLILRPQQQ
TLLTATNLSD FSADFLAAAR RFRVEDGQLF AG