Gene Francci3_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1990 
Symbol 
ID3903698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2338343 
End bp2340586 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content70% 
IMG OID637879326 
Productmetallophosphoesterase 
Protein accessionYP_481093 
Protein GI86740693 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTATT GTCGGAGACT AGACTCAGGC ATGCAGATCA TCGGCGCCGT GCTCACCATC 
CACGGCAACG CGCTCGCAGG GATCGCGGTA AGCGACGGGC GGACCTGGAC ATTCACTCGC
GACGACGGCT CGTTCAGCCT CCGCACCGTG CCGGGGCGCC CGGTCTGGGC CCGGCGCCCC
AACGGCTGGA CGGGCCGCTG GTGGCAGCAC CCGGCAGCGG GGGAGCCGGT GCTGTTTCAG
CTGCACCCAA GCCGCACCGG CAGCGATCCA CAACCACCCT CGGCTCCCTC AGGGGAGCTG
CGATTCGCGC ACATCACGGA CACGCATGTC AGCGCGATCG ACGGCGACCC GTCCCAGGCG
GTCGAGCTCG CCGCCCGCTA CGGCGACCAG ACGGACACCA CCAGCGGCCT GCACCACGCG
CTGCGGACCG CAGCCGACCA CGGCGCGGCA TTCGCCGTGA TCACCGGTGA CCTCACCGAC
CACGGCACAC CCGAAGAGTT CAGGCGCGTA CTCGACTCCC TCGCCGCAGC GCCGCTGCCG
GTAGAGATCG TGCCGGGTAA CCACGACCAT TACGGCCACC GCCACCAACC CCACCCCAGC
GACACCCCCC ACGGTGGCGG GTTTCTCGGC GCGGCTACCC TCACCCGCTA CGAGCAGGCC
ATGGGCCCGC GCTGGTGGTC AGCTGACCTG GCCGGCGTGC ACCTCCTCGC CCTGGACTGG
TTCAGCGCCT GGTGCGCGAT CGATGACACC GATCAGCAAC GCTTCATAAT CACCGATCTC
GCCACCCGAA CCCCCGGGCT GCCAGTCGTC GTGCTCACCC ACGACCAGCC CGACCACGAC
ACACTCGAAC TGATCCGCTA CAGTGCCGCA CCCGACAGCC TCCTCGCCGT CCTGTCGGGA
CATTGGCATG CCGACGCGCA GCGCAACGTC GGCGGCTGTC ACCTGCTCAG CACGCCCGCA
GCCAGCTTCG GCGGACTGGA CTGGTCACCA CCGCAGCTCC GCCTCATCAC CCTCACCCCC
GGCTTGAGAA CTATGGATCT ACGGCACGAC ACGATCCCGG CGCTACCGAA ACCGCCACGG
TCACCGACCA CGTCTCGCGC CGATGCCCCG GCCTCGCCCC GCACCACCAG CCACTCGATC
GGCGCCCATC AACATCTGGG CACCCTCGCG ACCATCGCCG GCACCGTCAT CGCACCGAGC
ACCGACACCC ACGGCGCAGG ACACCTGACC CGCCTACACC CCTCCAACAC CGGCAGTAAC
GACAGCCGAG TCGACGTGCT GTGGACGGTG CGCGCCGCAG ACGACCCCAT CACCGGCGTT
CTCGCCGGCC ATGACCAGAT CCTGGCGTGC AGCCATGCGG GCACCCTCAC CGCGCTCGCC
CCAGCCACCG GCGCACCCCA TTGGACCCGA CACCTCCCAC ACCGGCAGCG ACGCCGGCTG
CTAGCCACCC CCATCCTCAC CGCGGCGGGC CGGCTGATCG TCGGTGACGT CGGCGGCGTC
ACCTGCCTGG ACCTCGACAC CGGGGATATC GCCTGGCACC GCGACCAGCT CGGCCAGGTC
GACACCCTGC TCACCTACGG AACCGGTCTG GCCACCGACC GCTTGGCCGT GCTGCCCCTC
GGCGGCCCCA CCCCGGGCCT GACCGCCCTG GACCTGCGCG ACGGCACCAT CACCTGGACC
GATCCGCCCG GCACACCACC GCCCTCCAGC TCGCTGGTCG CCATTGACGG GACCGATGCG
CTGCTGCTCC GTACCGCCGG ACCCACCCTC GAGCGGCTGA ACCTCTCCAC CGGCCAGACT
CGGTGGCGCA CCACCCTGAC CGGCCGCTTC TCCACAGCCG CTCCCCTGGT CACCGACGAG
GCAATCGTGC TCGTCACCGG CGACGGCATC GCGCACCGGC TCGACCCGGA TCACGGCGGC
ATCCTCGACC GCCAGCACCT GCACGGGCTG CGCCCCGCCT ACGGTCCCTA CCGGTCCACT
GGTACCGGCG CGCCCACCAC TGCCGTCCAT ACCCCACTCG GACCGATGAT CGTGCTGCTC
GATGGCAGTA TCTGGCAACT GGACAGCCCC GCTGGTCCGC TGCTGGTCGG CGACGTCGCG
GCGCCCGTCA CCACCCAGCC TGTCCTGCTC GGATCGAACA CCCTCGTCGT GCTCAGCACC
GACGCGGTCG TTCACCTGCT CGACATCAAC GCCACCGCAA CCCGCCCCAT GCTTGCCGGT
CCGGCATCGC GGTCTGCCTC ATGA
 
Protein sequence
MIYCRRLDSG MQIIGAVLTI HGNALAGIAV SDGRTWTFTR DDGSFSLRTV PGRPVWARRP 
NGWTGRWWQH PAAGEPVLFQ LHPSRTGSDP QPPSAPSGEL RFAHITDTHV SAIDGDPSQA
VELAARYGDQ TDTTSGLHHA LRTAADHGAA FAVITGDLTD HGTPEEFRRV LDSLAAAPLP
VEIVPGNHDH YGHRHQPHPS DTPHGGGFLG AATLTRYEQA MGPRWWSADL AGVHLLALDW
FSAWCAIDDT DQQRFIITDL ATRTPGLPVV VLTHDQPDHD TLELIRYSAA PDSLLAVLSG
HWHADAQRNV GGCHLLSTPA ASFGGLDWSP PQLRLITLTP GLRTMDLRHD TIPALPKPPR
SPTTSRADAP ASPRTTSHSI GAHQHLGTLA TIAGTVIAPS TDTHGAGHLT RLHPSNTGSN
DSRVDVLWTV RAADDPITGV LAGHDQILAC SHAGTLTALA PATGAPHWTR HLPHRQRRRL
LATPILTAAG RLIVGDVGGV TCLDLDTGDI AWHRDQLGQV DTLLTYGTGL ATDRLAVLPL
GGPTPGLTAL DLRDGTITWT DPPGTPPPSS SLVAIDGTDA LLLRTAGPTL ERLNLSTGQT
RWRTTLTGRF STAAPLVTDE AIVLVTGDGI AHRLDPDHGG ILDRQHLHGL RPAYGPYRST
GTGAPTTAVH TPLGPMIVLL DGSIWQLDSP AGPLLVGDVA APVTTQPVLL GSNTLVVLST
DAVVHLLDIN ATATRPMLAG PASRSAS