Gene Francci3_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0003 
Symbol 
ID3902949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2488 
End bp3696 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content70% 
IMG OID637877332 
ProductDNA polymerase III, beta subunit 
Protein accessionYP_479126 
Protein GI86738726 
COG category[L] Replication, recombination and repair 
COG ID[COG0592] DNA polymerase sliding clamp subunit (PCNA homolog) 
TIGRFAM ID[TIGR00663] DNA polymerase III, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000138278 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCC GGGTGGAACG GGACGAATTC ACCGAAGCGG TCGCCTGGAC CGCACGCACG 
TTGCCGAGCC GGCCGACCAC CCAGTTGCAG GTGCTCTCGG GCCTGCTCCT CGACGCGACC
GGACCGATAC TCAAGATCGC CGCGTACGAC TACGAGGTGG CCGCCCAGTG CACGGTGCAC
GCGACCGTGT CGGAGGAGGG GCGCGGGCTC GTCAACGGCA AACTGCTCGC GGAGATCACC
CGGGCGCTGC CCGCGGCACC GGTCGATCTG GGCATCGACG GCACCCGCCT GGTGATCACT
TGTGGCAATG CTCGGTTCGC GTTGCCGATG CTGCCGGTGG ACGACTATCC CGCCCTTCCG
GCCATGCCCC CGATCACGGG GCACATCGAG GGATCCGCGT TCGCCGCGGC GGTCTCACAG
GTGGCGATCG CGGCCGGTCG TGACGACACG CTGCCGGTCC TCACCGGGGT CCGTATCGAG
ATCGAGGGGG ACACTCTCAC CCTCGCCGCG ACCGACCGTT ACCGGCTCGC CGTGCGGACG
TTGAAGTGGC GGCCGTCGGA GACAGCCGCC GGGCCCGATG AGGACGGCGT GACCGGTGTG
GACGGCCCCC CGCCCACACC GGTCACTGTC GCTCTCGTGC CGGCCCGCAC CCTGCTCGAC
ACCGCGAAGT CGTTGTCGGG TTCGGGGGTG GAGGTATCCA TCGCGCTCGG GACCGGCCCC
TCCGGCGAGA CGCTGGCCGG TTTCGCCGGC TCGACCCGGC AGACGACGAC CCGGTTGCTC
GACGGAAGCT TCCCGCCCTA CCGCAAGCTG TTGCCTGACA GTTCGCCGTT GATCGCGCAG
CTGGAGATCG CCCCGTTGCA GGAGGCCGTC AAACGGGTGG CGCTGGTCGC CGCGAAGACC
GCACCGGTGC AGCTGACGTT CTCCCCGGAC CACCTCGTGC TGGAGGCCGG GACGGGCGGT
GAGGCGCAGG CGACGGAGAC CCTCCCGGTG ACCTACGACG GACCGGAGCT GTCCGTGGCG
TTCAACCCGT CGTACCTGCT CGACGCACTC GGGGCGCTGG AGTCGGACGT CGTACGGATC
GGTTTCGCCA GCGCGGAGGA CCCGGCGGTG GCGGCGAACA AGCCGGCGAT CCTGACCGGG
AAGGCCGACG ACGACGGCGA GGTCCCCGAC TACCGGTACC TGCTGATGCC GATCCGCCTG
CACGGGTGA
 
Protein sequence
MKFRVERDEF TEAVAWTART LPSRPTTQLQ VLSGLLLDAT GPILKIAAYD YEVAAQCTVH 
ATVSEEGRGL VNGKLLAEIT RALPAAPVDL GIDGTRLVIT CGNARFALPM LPVDDYPALP
AMPPITGHIE GSAFAAAVSQ VAIAAGRDDT LPVLTGVRIE IEGDTLTLAA TDRYRLAVRT
LKWRPSETAA GPDEDGVTGV DGPPPTPVTV ALVPARTLLD TAKSLSGSGV EVSIALGTGP
SGETLAGFAG STRQTTTRLL DGSFPPYRKL LPDSSPLIAQ LEIAPLQEAV KRVALVAAKT
APVQLTFSPD HLVLEAGTGG EAQATETLPV TYDGPELSVA FNPSYLLDAL GALESDVVRI
GFASAEDPAV AANKPAILTG KADDDGEVPD YRYLLMPIRL HG