Gene Francci3_1225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1225 
Symbol 
ID3902970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1464459 
End bp1467635 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content76% 
IMG OID637878558 
Productputative exonuclease 
Protein accessionYP_480332 
Protein GI86739932 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCCCC ACCACCTGAC CCTGGCGGCG TTCGGCGCCT TCCCCGGCAC CGTCGAGATC 
GATTTCGACG TCCTCGGCAG CGGCGGCCTG CTCCTGCTCT GCGGGGAGAC CGGGGGCGGC
AAGACGACGC TGCTCGACGC GGTCGGGTTC GCGCTGTTCG GCCGGGTGCC GGGGATGCGC
GGGGAGGTCT CCGGGCCGCC GGACCTGCGC TCGCACCACG CCGCGGCATC CCTACGGCCG
GAGGTGACGC TGGAGTTCAC TGTGGCGGCG GGACGTTTCC GGATCACCCG CGGCCCCGCC
TGGGACAAGC CGAAACGCGG CGGCGGGACG ACCCGCGCCC ATCCGACGGC GCGGCTGGAA
CGGTTCGACG GCGCCGGCGG CTGGGAGACG GTCGCCACCC GCATGGAGGA CGTCGGCCAT
GAGATCGACC TGCTCGTCGG CATGAGCGAC AAGCAGTTCT TCCAGGTCAT CATGCTCCCG
CAGGGACGCT TCGCCGACTT CCTCCAAGCC GACCACGGTG CGCGCGAGAA GCTCCTGAAG
CGGCTGTTCC ACGTCAACCG CTTCGAATAC GCCGAGCAGT GGCTGCTGGA CCAGGCAAAG
ATCGCCGCCG AACGGCTGGC GCTGGCCCGC GCCGAGCTCG ACCGGGTGAC CGCCCGGGTC
TCGCAGGTGG CGGCCGTCGA CGAGCCGGAG GATCCGACGG CTGACGCCGG CTGGGCCTCG
GACCTCGCCA GGACCGCCGC GGCGGCGGCC ATCTCGGCCG ACGAGGCGGC CGGCGCGGCG
GCAGCGCGGC GCACCGCGGC CGAGGAGGCC TTGGACCAGG CACGCGACCT CGCCCGACGC
GTCGGGCAGC GCCGGGTACT GGCCGCCCGG CAGGAGGAGC TCGCGGCACA GGCGCCCCGG
ATCGAGCTGC TCGCCACCGA GCTGGACGCG GCCCGCCGGG CCGCCGTCGT CGCACCCGCC
CTGGGCGAGG TCCACCGCCG CGCCGCCGAG GTCCACCGGG CCGAGGCCGC CGAACGCGAC
GCCCGGGACC GGCTCGGACG CCATCCATCC GGGCTCCTTC CCGGGGAACC CGCCCCCTCG
GGACCGCTGC CCGAGGAGGG CACCGGGAGG CCTGCCGAGG CACCTGCCGA AGCACCTGCC
GAGGAACTCG CCCGCCTGGC CCGCCTCGCC CACACCGAGA CCGGGCGACT GGGAACGTTG
GCCGACACCC TCGCCGCCGC CGAGCGGGAC GCCGAGGAGG CCGCCGGGGC CGACCAGGAT
ACGGCCGCCT ACACCCGGTC CGCCGCCGAA CTCGCCGAGG CGATCAGCGT CACGCTGCCC
CGCGCCCGCG TCGCGGCCGA GGCCCGGGTG GAGGTCGCCC GGCGGGCCGG CGCCGCTCTG
CCCGGCCTGG TCGAACGGGC CCGGTGGGCC AAGGAGCTGG CCTCCGCCGT CCGGGAGGGG
CGGCAGGTGC GGACCGTGGC CGACGAGGCC GAACGAGACG CATCCGCGGC CCGGACCCAC
GCATCCGACC TCCGGCAGCA GCGATTCGAC GCCATCACCG CCGAACTCGC CGCCGCGCTT
GTCCATGACA CGCCCTGTCC GGTCTGCGGG GCCCTGGAGC ATCCCGACCC GGCCGAGACC
CGGGCCGACC ATGTGAGCAA GGACGCCGAG ACCGCGGCCG GGCAGGAGGC GGACCGGCTC
GCCGACGCCG CGACGAGGGC GAGCCGGGCG GTGGCCCACT GGGAGTCCCG GGTCCGGGCG
CTGCACGCCG ACCTGGTCGG CCCAACGGAC CCGGACCACA GCGCGAACCC GGACCACAGC
GCGAACCCGG ACCACAGCGC GAACCCGGAC CACAGCGCGA ACCCGGACGA CCGCGTCGAG
GCGGCCTTCG CCGAGATCCG CGCGCTTCCG GTCGCGACGC TGCTGGGCGG CTCCGGGGCG
CCGGTCGCGG ATCGGCTCGA TGAACTCGCC GCCGTGCTGA CCCACGCGGT CCGCGCCCGG
ACCCGGACGG CGAAGAAGCT CGCCGCGGCG GAGGCCGCGT TGCGCGAGGT CCACGAGAAC
GAGAAGGAGA CCGCGGCCCG CCATTCCGCC GCACGGACCG CGGCGCAGGC GGCCCGGGAA
CGCGCGGCGG ACGCCCGGGA CCGCGCCGCC CGACGGCTCG CCGGCGTTCC GGCGGAGCTA
TGCGACCCCG ACGCCCTGGC CGCGCGCCGC CGTGCCGTCA CCGCCCTCGC CGCCGATCAC
GAGGCCGCGC AGGCCGCGGC CCTGGCTGCC GAGCAGGCCC GGGCCGAACA CATCCGCGCC
GGGACGGCCG CCCTCGACCA GGCGCGGCAG GCGGGGTTCT CCGACCTGGA CGACGCCGCC
GAGGCCGTGC GGGACTCCGA CTGGATGCGC CGCGCCGCGG ACGAGGCCCG GGCCCATCGG
GACGAGCTCG TCGCCGTCGG GGCCAGGCTG GCGGGCGAGG ATCTCGCCGT CGATCCGGAC
ACCGAGGTCC CGCTGGCCGA CCACGAGACG GCCGTGACCG ACGCCCGCGA GGTCCACGAG
AGCGCGCTCG CCACGGCCGC ACGCGCCCGG GAACGGGCCG AACGGCTGGC CTCGCTGGTC
ACGGAGTTCA CCGAGAAGCT CACTACCCTG GATCCGCTGC GGGAGGCGGC GGACGAGCTG
CGGGGCCTCG CCGATCTGGC CGCCGGGCGG GGTGCCAACA CCGAGCGCAT GCCGCTGTCC
AGCTTCGTGC TGGCGGCTCG GTTGGAAGAG GTGGCCGCCG CCGCCAGCCA TCGGCTCGCG
GCGATGAGCA GCGGTCGGTT CACCCTGGTG CACGACGCGG GGGAAAGCCG CGACAAGCGT
CGGCGCGCCG GCCTCGGGCT GCTGGTCGAC GACGCCTGGA CCGGCCGGCG GCGCGACACC
GCCACCCTGT CCGGCGGGGA GACCTTCCAG GCGGCGCTGT CACTGGCCCT CGGGCTCGCC
GACGTCGTGA CGGCCGAGGC GGGAGGCCGG CGGATGGACG CGCTGTTCAT CGACGAGGGT
TTCGGCACGC TCGACCCGGA CAGCCTCGAC GAGGTGATGA CCGTTCTCGA CGAACTGCGT
TCCGGCGGCC GGCTCGTCGG CGTCGTCAGC CATGTCACCG AGCTGCGCCA GCGCATCCCG
AACCAAATCC GGGTCGTCAA AGGGGTCGGC GGCAGCCGGG TCGAGACCAC GTCCTGA
 
Protein sequence
MRPHHLTLAA FGAFPGTVEI DFDVLGSGGL LLLCGETGGG KTTLLDAVGF ALFGRVPGMR 
GEVSGPPDLR SHHAAASLRP EVTLEFTVAA GRFRITRGPA WDKPKRGGGT TRAHPTARLE
RFDGAGGWET VATRMEDVGH EIDLLVGMSD KQFFQVIMLP QGRFADFLQA DHGAREKLLK
RLFHVNRFEY AEQWLLDQAK IAAERLALAR AELDRVTARV SQVAAVDEPE DPTADAGWAS
DLARTAAAAA ISADEAAGAA AARRTAAEEA LDQARDLARR VGQRRVLAAR QEELAAQAPR
IELLATELDA ARRAAVVAPA LGEVHRRAAE VHRAEAAERD ARDRLGRHPS GLLPGEPAPS
GPLPEEGTGR PAEAPAEAPA EELARLARLA HTETGRLGTL ADTLAAAERD AEEAAGADQD
TAAYTRSAAE LAEAISVTLP RARVAAEARV EVARRAGAAL PGLVERARWA KELASAVREG
RQVRTVADEA ERDASAARTH ASDLRQQRFD AITAELAAAL VHDTPCPVCG ALEHPDPAET
RADHVSKDAE TAAGQEADRL ADAATRASRA VAHWESRVRA LHADLVGPTD PDHSANPDHS
ANPDHSANPD HSANPDDRVE AAFAEIRALP VATLLGGSGA PVADRLDELA AVLTHAVRAR
TRTAKKLAAA EAALREVHEN EKETAARHSA ARTAAQAARE RAADARDRAA RRLAGVPAEL
CDPDALAARR RAVTALAADH EAAQAAALAA EQARAEHIRA GTAALDQARQ AGFSDLDDAA
EAVRDSDWMR RAADEARAHR DELVAVGARL AGEDLAVDPD TEVPLADHET AVTDAREVHE
SALATAARAR ERAERLASLV TEFTEKLTTL DPLREAADEL RGLADLAAGR GANTERMPLS
SFVLAARLEE VAAAASHRLA AMSSGRFTLV HDAGESRDKR RRAGLGLLVD DAWTGRRRDT
ATLSGGETFQ AALSLALGLA DVVTAEAGGR RMDALFIDEG FGTLDPDSLD EVMTVLDELR
SGGRLVGVVS HVTELRQRIP NQIRVVKGVG GSRVETTS