Gene Francci3_4168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4168 
SymboldnaE 
ID3907133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4969181 
End bp4972930 
Gene Length3750 bp 
Protein Length1249 aa 
Translation table11 
GC content66% 
IMG OID637881496 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_483245 
Protein GI86742845 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACGG ATTCTTTTGT TCACCTCCAT GTGCACACTG AGTATTCGAT GCTGGACGGT 
GCGGCGAAAA CGGGCCTGCT TTTCAAGGAA GCGGCGAAGT TGGGGATGCC GGCTGTCGGG
ATGACCGACC ACGGTAACAT GTTCGGTGCC TACGAATTTT ATCAGGGTGC GAAGTCCGCG
GGTGTCAAGC CGATCATCGG GATCGAGGCG TACCTGGCGC CCGAGTCGCG CCACCACAAG
CGGCCTGTCC TGTGGGGGGA ACGCTCGCAG CGCGACGTCG ACCCTGCGGG TGAGGGCGGT
GACGTGTCCG GTGGCGGCGC CTACACCCAC ATGACGATGC TCGCAGCCAA CGCCGCCGGG
CTGCGCAACC TGTTCCGGCT GTCGTCGATC GCGTCGATCG AGGGCTACTA CCGCAAGCCG
CGGATGGACC ACGAGCTGGT CTCCCAGTAC TCCGAAGGGA TCATCGCGAC CACGGGCTGT
CCGTCAGGTG AGGTGCAGAC CCGGCTTCGT CTGGGCCAGT TCGACAAGGC GCTCGCGGCA
GCGGCCACCT ACCAGGAGGT CTTCGGCGCG GACAACTTCT TCCTGGAGTT GATGGATCAC
GGTTTGCCGA TCGAGCGCAG CGTCCGCCAG GGGCTGCTCG ACATCGGCGA CAAGCTGGGG
CTTCGCCCGC TGGCCACGAA CGATTCGCAC TATGTGACCC AGGATCAGGC GGGAAGCCAC
GAGGTGCTGC TGTGCGTCGG CACCGGTAAA AAACTCGACG ATCCGACCCG TTTCAAATTC
GACGGATCGG GATATTATCT CAAGTCTTCG GAGGAAATGC GCAACTTGTG GGACAGTGAG
GTCCCCGGGG CCTGCGACAG CTCTCTGCTT ATCGCCGAGC GCGTCGAGTC CTACGACGAT
GTTTTCAAGT TTGTGGACCG GATGCCGCGG TACCCTGTCC CCGCGGGTGA GACGCAGCTG
TCCTGGCTGC GCAAGGAAAT CGACCGCGGA CTGACGTGGC GCTTCCCTGC GGGCGTGCCC
GCCGATGTGG TCGAGCGCGT CGACTACGAG GTCGGCGTGA TCGACAAGAT GGGTTTCCCG
GCTTACTTCC TGGTCGTCGC CGACATCTGC AAGTTCGCGA GAGATCGGGG CATCGGGCTG
GGTCCGGGGC GTGGATCGGC GACCGGGTCG ATGATCGCCT ACATCCTGGG TATAACCGAG
CTGAACCCGA TCGAGCACGC GCTGATCTTC GAGCGGTTCC TCAACCCGGA GCGGATCAGC
CCGCCGGATA TCGACCTGGA CTTCGACGAA CGTCGACGCG GTGAGGTGAT CCGCTACATC
ACCGAGCAGT ACGGCGAGGA CCGCGTCGCG CAGATCAACA CCTTCGGCAC GATCAAGGCC
AAGGCCGCGA TCAAGGACTC GTGCCGGGTG CTGGGTTACG ACTACGCCCT CGGCGACAAG
ATCTCGAAGG CGATGCCGCC GGATGTGATG GGCCAGGGCA TCCCGCTGGC CGGGATCTTC
GACCCGAACC ATGAGCGGTA CGGTGAGGCC GCCGAGGTCC GGGCCCAGTA CGAGACCGAC
ACCAAGGTTC GGAAGGTCAT CGACACCGCG CGGGGGCTGG AGGGGCTCAC CCGCGGCACT
GGTGTCCACG CCGCCGGGGT CATCCTGTGC TCGGAGCCGC TGCTGGATGT CCTGCCGATC
CATCGCCGCG ACAATGACGG CGCGATCATC ACCGGGTTCC CGTTTCCTCA GTGCGAGGAG
ATGGGCCTGC TCAAGATGGA CTGCCTCGGG CTGCGGAACC TCACCGTCAT CGGTGACGCG
ATCGAGGCCG TCAAGCGTAA CCGCAACGTC GACATCGACC TGTCCACCCT CCCCTTGGAG
GACGCCAAGG CCTTCGAGTT GCTCGCCCGC GGCGACACGC TCGGGGTGTT CCAGCTCGAC
GGCGGACCGA TGCGTAACCT GCTGCGCCTG ATGGCACCGA CGAAGTTCGG CGACATCGCC
GCCGTGCTCG CGCTCTACCG GCCCGGCCCG ATGGCGGCCA ACAGCCACAT CGAGTACGCA
GACCGCAAGA ACGGCCGCAA GGAGATCCTC CCGATCCATC CCGAGCTCGC CGAGGCGCTG
GAGCCCATCC TCGGCGAGAC CTACCACCTG GTGGTCTACC AGGAGCAGGT CATGGCCATC
GCCCGGGAGC TGGCCGGCTA CAGCCTCGGC GGCGCCGACC TGCTGCGCCG CGCGATGGGT
AAGAAGAAGA AGGAGATCCT GGACAAGGAG TTCGCTCGGT TCTCCGCCGG GATGAAGGAG
AGGGGCTACA CCGACGCGGC GGTCCAGGCG CTGTGGGACG TGCTTGTCCC GTTCTCCGGC
TACGGGTTCA ACAAGTCCCA CACCGCTGGG TACGGTGTCG TGTCGTTCTG GACGGCCTAC
CTCAAGGCCA ACTACCCGGC CGAGTTCATG GCGGCGCTGC TGACGTCCGT CGGTGACGAC
AAGGACAAGA TGGCGGTGTA CCTGGCGGAG ACCCGCCGGA TGGGTATCCA GGTCCTGCCC
CCCGACGTCA ACGAGTCGGA CCTGCGGTTC GGTGCGGTCG GGGACTCGAT CCGCTTCGGC
CTGGGGGCCG TCCGTAATGT GGGGGAGAAC GTCGTCGCCT CGATAGCCGC CGCCCGGCGG
AGGAAGGGCG CGTATGAGTC GTTCGCCGAC TTCCTGCAGA AGGTCGACAT CGGGGTGTGC
AACAAGCGCA CCATCGACTC GCTGATCAAG GCCGGGGCGT TCGACTCGCT GGGCCATCAC
CGCCGGGTGC TGGTCAACGT ACACGAGAAC GCGATCGACG CGGTGATCAT CACGAAGCGG
GCCGAGGCCA TCGGCCAGTT CGACCTGTTC GGCGACGGGG GCGCCGGCGA GGAGGAGGAG
AGCCCGGGGC TAGGACTCGA CCTGGACCTG TCCGGCCCCG AATGGCCGAA GAAGGAGCTG
CTGGCTCAGG AACGCGACAT GCTCGGCCTG TACGTGTCCT CGCATCCGCT CGAAGGCGCC
GAGCGGGCGC TGGATCGCCA CCGCGACACC CGGATCGTCG ACCTCGCCGA GGCCAACGAC
GGGACGACCG TACAGATCGC CGGTATCATC AGCAAGATCG ACCGGCGGAT CAACAAGAAC
ACCGCCAAGG CGTGGGCGAT CGTGACCGTC GAGGATCTCG ATGCCTCCGT CGAGGTGCTG
TTCTTCCCCC AGTCCTACGA GGTCCATTCG TACGCGCTCG CGACCGACGC GGTCATCTCG
GTGCGGGGCA GGATCAACGA GCGGGAGGGG TCGGTCTCGC TGTTCGCCCA GGACCTGACG
GTGGTCGATG TCGCGACGCA CGTGAACGGG CCGCCGGTCG TCATCACTCT GCCGTCACAC
AAGATCACAC CACCGCTGGT CGATGACCTC AAACTCGTCC TGACGACCCA TCCCGGGACA
ACTCCGGTGC ACCTGCGTCT CGAAGGCCCG CAGAACACGC ATCTGCTCCT GCTCGAACTC
CAGGTGCAGG CGAGTAGTTC GCTGCTCGGG GACCTCAAGG CCCTGCTGGG GGCGCTGTTG
CATTGGGCGG AGGCGGGGCG TGGTGGAGCC GTCTCGGGTG GGCGGTCAGA TGGCCAGGGT
GAGTTCGGTG AAGGCAGCCG TCAGCCGGTT CCGGAGGTCG GCATCGACGC CCAGTTCGTA
GTGGCCGCGG CGGATGTTCT GCACGAACGC GTGCCCGGAG CCGATGATCT GTGCGGAGCG
GAGGCGTTTC AGGCCGCGCA TCGGCCGTAG
 
Protein sequence
MPTDSFVHLH VHTEYSMLDG AAKTGLLFKE AAKLGMPAVG MTDHGNMFGA YEFYQGAKSA 
GVKPIIGIEA YLAPESRHHK RPVLWGERSQ RDVDPAGEGG DVSGGGAYTH MTMLAANAAG
LRNLFRLSSI ASIEGYYRKP RMDHELVSQY SEGIIATTGC PSGEVQTRLR LGQFDKALAA
AATYQEVFGA DNFFLELMDH GLPIERSVRQ GLLDIGDKLG LRPLATNDSH YVTQDQAGSH
EVLLCVGTGK KLDDPTRFKF DGSGYYLKSS EEMRNLWDSE VPGACDSSLL IAERVESYDD
VFKFVDRMPR YPVPAGETQL SWLRKEIDRG LTWRFPAGVP ADVVERVDYE VGVIDKMGFP
AYFLVVADIC KFARDRGIGL GPGRGSATGS MIAYILGITE LNPIEHALIF ERFLNPERIS
PPDIDLDFDE RRRGEVIRYI TEQYGEDRVA QINTFGTIKA KAAIKDSCRV LGYDYALGDK
ISKAMPPDVM GQGIPLAGIF DPNHERYGEA AEVRAQYETD TKVRKVIDTA RGLEGLTRGT
GVHAAGVILC SEPLLDVLPI HRRDNDGAII TGFPFPQCEE MGLLKMDCLG LRNLTVIGDA
IEAVKRNRNV DIDLSTLPLE DAKAFELLAR GDTLGVFQLD GGPMRNLLRL MAPTKFGDIA
AVLALYRPGP MAANSHIEYA DRKNGRKEIL PIHPELAEAL EPILGETYHL VVYQEQVMAI
ARELAGYSLG GADLLRRAMG KKKKEILDKE FARFSAGMKE RGYTDAAVQA LWDVLVPFSG
YGFNKSHTAG YGVVSFWTAY LKANYPAEFM AALLTSVGDD KDKMAVYLAE TRRMGIQVLP
PDVNESDLRF GAVGDSIRFG LGAVRNVGEN VVASIAAARR RKGAYESFAD FLQKVDIGVC
NKRTIDSLIK AGAFDSLGHH RRVLVNVHEN AIDAVIITKR AEAIGQFDLF GDGGAGEEEE
SPGLGLDLDL SGPEWPKKEL LAQERDMLGL YVSSHPLEGA ERALDRHRDT RIVDLAEAND
GTTVQIAGII SKIDRRINKN TAKAWAIVTV EDLDASVEVL FFPQSYEVHS YALATDAVIS
VRGRINEREG SVSLFAQDLT VVDVATHVNG PPVVITLPSH KITPPLVDDL KLVLTTHPGT
TPVHLRLEGP QNTHLLLLEL QVQASSSLLG DLKALLGALL HWAEAGRGGA VSGGRSDGQG
EFGEGSRQPV PEVGIDAQFV VAAADVLHER VPGADDLCGA EAFQAAHRP