Gene Francci3_4258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4258 
Symbol 
ID3907225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5079510 
End bp5080967 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content73% 
IMG OID637881584 
ProductDNA repair protein RadA 
Protein accessionYP_483333 
Protein GI86742933 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1066] Predicted ATP-dependent serine protease 
TIGRFAM ID[TIGR00416] DNA repair protein RadA 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACC AGCGGACTAA CGTTGCGTCC ATGGCATCCA CTCGCCCGTC GGCCGGCGGT 
CGCCCGTCGA GTCGTGCGGC CTCAGGCGGG TTCCGGTGTA ACAGCTGCGG CTCGGAGTCC
GTGCGGTGGG CGGGACGATG CTCGCGCTGC CAGGAGTGGG GCACGCTGGA GGCCCAGGCC
CCCGCCCCAC GCCGGGGGGC CGGCCTGGTC GCGGGGGGTC GTTCCGCCTC GTCCGCCGCA
GTGACGGCAC CGGCTCTGCC GGTCATGTCC GTTGACGTGA CCGCGACGCG GCGTCGTCCG
ACCGGGATCG ACGAACTCGA CCGCGTGCTG GGCGGCGGCC TCGTTCCCGG CGCCGTGATC
CTGCTCGCCG GGGAACCGGG GGTCGGCAAG TCCACCCTAT TACTGGAGGT CGCGGCCCGC
AGCGCCGCCG CCGGTGCCCG CGCCCTGGTC GTCACCGGGG AGGAATCCGC GGCGCAGGTC
AGGCTGCGGG CGGGTCGGAC GGGTGCGCTG CACGAGGACC TGTGGATCGC CGCCGAGACG
GACCTCGGCG CCGTCCTGCG TCACGTCGAG GAGGTCTCAC CGGCCCTGCT GGTGGTCGAT
TCGGTGCAGA CCATCTCCGC GGCCGGGGTG GACGGAGCCG CGGGCGGGGT CACCCAGGTC
CGTGAGGTCA CCGCGGCCCT GATCCGGACG GCCAAGGCGC TGGGACTCGT GACGGTGCTT
GTCGGCCATG TCACGAAGGA TGGCCTGGTC GCCGGACCAC GCCTCCTGGA ACATCTGGTC
GACGTGGTGC TGCACTTCGA GGGCGAGCGG CACTCCGCAC TGCGCCTGGT CCGCGCGGGC
AAGAATCGGT ACGGACCAGC CGACGAGGTC GGCTGTTTCG AGATGGACGA TTCCGGTATC
CATGGCATCG CCGACCCGAG CGGGCTGTTC CTGTCCCGGT CCAGCGGCGC GGGGCTGGAG
GCGGTGCCGG GAACCTGTGT CACCGTAACG GTTGAGGGCC GACGGCCACT CGTCGCCGAG
GTTCAGGCCC TGGTGGCGGA AACCTCGGCG CAGATCCCCC GGCGCGCCGT GTCCGGGCTC
GATCCGGCCC GGGTGGCGAT GATTCTCGCC GTGGTCGAGC GCCGGGCGAA GGTGCGCTTC
GGCCGGGCCG ACGTGTACGC CGCGACCGTC GGCGGGGTTC GGTTGGCCGA GCCCGCGGCG
GACCTCGCGA CGGCCTTGGC GATCGTCAGC GCGGCCAGGG ACCGTCCGTT GCCGGCCGAT
CTCGTCGCCA TCGGCGAGGT CGGCCTGGCC GGGGAGGTCC GCGCGGTGGG CTCGGTACGC
CAACGGCTCG CCGCCGCGGC CCGGCTCGGC TTTCGCCGGG CGCTCGTTCC GGCGGACCCC
GGCCCGGCTC CGGACGGCAT GAAAGTCACC GAGGTACCCG ACCTGATCGG CGCGATCAGC
AAAATGCATG AAACATAG
 
Protein sequence
MSDQRTNVAS MASTRPSAGG RPSSRAASGG FRCNSCGSES VRWAGRCSRC QEWGTLEAQA 
PAPRRGAGLV AGGRSASSAA VTAPALPVMS VDVTATRRRP TGIDELDRVL GGGLVPGAVI
LLAGEPGVGK STLLLEVAAR SAAAGARALV VTGEESAAQV RLRAGRTGAL HEDLWIAAET
DLGAVLRHVE EVSPALLVVD SVQTISAAGV DGAAGGVTQV REVTAALIRT AKALGLVTVL
VGHVTKDGLV AGPRLLEHLV DVVLHFEGER HSALRLVRAG KNRYGPADEV GCFEMDDSGI
HGIADPSGLF LSRSSGAGLE AVPGTCVTVT VEGRRPLVAE VQALVAETSA QIPRRAVSGL
DPARVAMILA VVERRAKVRF GRADVYAATV GGVRLAEPAA DLATALAIVS AARDRPLPAD
LVAIGEVGLA GEVRAVGSVR QRLAAAARLG FRRALVPADP GPAPDGMKVT EVPDLIGAIS
KMHET