Gene Francci3_4231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4231 
Symbol 
ID3907197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5047526 
End bp5049781 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content67% 
IMG OID637881557 
ProductTerpene synthase, metal-binding 
Protein accessionYP_483306 
Protein GI86742906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCTT TCACGCTACC GGAGTTCTAC GTGCCCTATC CGGCACGGCT CAATCCGAAC 
CTGGAACAGG CCCGCGTGCA CAGCAGGGCT TGGGCCGACG AGATGGAGAT GATCGACTCT
CCGCAGCACG GAACCGCGAT CTGGACCGAG GCCGACTTCG ACGCACACGA CTACGCCCTG
CTCTGTGCAT ACACCCATCC GGACTCCGTC AGCCGAAAAC TCGATCTGGT CACCGACTGG
TATGTCTGGG TTTTCTACTT CGACGACCAC TTCCTGGAGC TGTACAAGCG GTCCCACGAC
ATGGCCGGCG CCCGGGCCTA CCTCGACCGC CTGCCCGCCT TCATGCCCGT CGACGGTGAG
ATCACCGAGA CGCCGACCAA CCCGGTGGAA CGCGGACTCG CCGACCTGTG GACCCGGACC
GTCCCGGAGC GTTCCGCCGA CTGGCGGCGG CGGTTCGCGG TCAGCACCAA GAACCTGCTG
GACGAGTCCC TGTGGGAACT AGCCAACATC AACGCCGGCA GGCTCGCCAA CCCGATCGAA
TACGTCGAGA TGCGGCGCAA GGTCGGCGGC GCGCCCTGGT CGGCGAACCT CGTCGAGCAC
GCCGCCGACG CCGAGGTGCC GGCCCAGGTC GCGGCGACCC GGCCGCTGCA GGTACTGCGG
GACACCTTCG CCGACGCCGT GCATCTGCGC AACGACCTGT TCTCCTACCA GCGCGAGGTC
GAAGAGGAGG GCGAGCTCAG CAACGGCGTG CTCGTCATCG AACGTTTCCT CGGCTGTGGA
ACCCAGGAGG CCGCCGACAC GGTCAACGAC CTGCTCACCT CGCGGCTGCA CCAGTTCGAG
CACACCGCGG TCACCGAACT GCCGGCGGTC CTCGAGGAGC ACGGTGTCGA CCCGGGGAGC
CGGCTGGAGG TCCTGGCGTA CGTGAAGGGT CTGCAGGACT GGCAGTCAGG GGGCCACGAA
TGGCACCTGC GCTCGAGCCG GTACATGAAC CGGGCCGTGG CGCCTGAGAG CGGCGAGCTC
AGCGGACTCC TCGGACTCAC CGGTCTCGGT ACCTCGGCGG CCCGGATTGT GCCGTCCCTC
GTCACCACCA CACCGCGCCG GATCCGCTCC TTCACGCACA TCCCGCACCA GATCGTCGGG
CCGCTGCGCC ATCCCGACTT CTGCATGCCC TTCTCGACCG GGCAGAGCCC ACATCTGGAC
GCCTCCCGCC GGGAAAACAT CATCTGGGCC CGGGCGGTAG GGATGCTGGA CCCGATTCCG
GGCATCTGGG ACGAACACAA ACTGCGGGCG TTCGACTTCG CGCTGTGCTC GGCCGGAATC
CATCCGGACG CGACCCTGCC GGAACTGAAT CTGACGACGG ACTGGCTCAC CTGGGGGACC
TACGCCGACG ACTACTACCC GGTAATCTTC GGCCGCACCC GGGACATCCT CGGGGCCAAG
GTGTGCAACG CCCGGCTGTC GGAGTTCATG CCGCTGGACT CCCCCGTCAC CGCGGTGCCG
GCCAACGCGT TGGAACGCGG TCTGGCCGAC CTGTGGACGC GCACCACCGA GACCATGGCG
CCAGGCGCGC GCGAGACGTT CCGCGGCACC GTCGAGGTTA TGATCGACAG CTGGCTGTGG
GAGCTGGCAA ACCAGGCCCA GAATCGCATT CCCGACCCGA TCGACTACAT TGAGATGCGC
CGGGCGACCT TCGGTTCGGA TCTGACGATG AGCCTGGCCC GGCTGGCCCG GCTGGCCCAG
GAACAGACCG TGCCACCCGA GATCTACCGC ACCCGGCCGA TCCAGGCCTT GGAGAACGCG
GCCGCGGACT ACGCCTGTCT CCTCAACGAC GTCTTCTCCT ATCAGAAGGA GATCCAGTTC
GAGGGTGAGA TCCACAACTG TGTCCTCGTC GTCGAGAACT TCCTCGACTG CGACCGGGAG
CGTGCCCTCG CGGTGGTCAA CGATCTGATG ACCTCCCGGA TACGCCAGTT CGAGCACATC
GTGGCGCACG AGCTTCCGGC GCTGTTCGAC AGCTTCGCCC TGGACGCGTC GGCGCGGCAG
GCGCTGCTGG GCTACGCGCG GGAACTCCAG AACTGGCTGG CCGGGATCCT CCGCTGGCAT
GAGGGCACGC ATCGGTACGA GGAGTCCGAA CTGCGGTACC ACCCAGCGGC GGGAGTACGG
CCCTTCGGCG GCCCCACGGG CCTGGGCACC TCGTCCGCCC ACGTCCGCCC ACGTCCGGCC
GCGGCGGCCG GCGCGGCCGG CGATAGTGAA ATGTAG
 
Protein sequence
MQPFTLPEFY VPYPARLNPN LEQARVHSRA WADEMEMIDS PQHGTAIWTE ADFDAHDYAL 
LCAYTHPDSV SRKLDLVTDW YVWVFYFDDH FLELYKRSHD MAGARAYLDR LPAFMPVDGE
ITETPTNPVE RGLADLWTRT VPERSADWRR RFAVSTKNLL DESLWELANI NAGRLANPIE
YVEMRRKVGG APWSANLVEH AADAEVPAQV AATRPLQVLR DTFADAVHLR NDLFSYQREV
EEEGELSNGV LVIERFLGCG TQEAADTVND LLTSRLHQFE HTAVTELPAV LEEHGVDPGS
RLEVLAYVKG LQDWQSGGHE WHLRSSRYMN RAVAPESGEL SGLLGLTGLG TSAARIVPSL
VTTTPRRIRS FTHIPHQIVG PLRHPDFCMP FSTGQSPHLD ASRRENIIWA RAVGMLDPIP
GIWDEHKLRA FDFALCSAGI HPDATLPELN LTTDWLTWGT YADDYYPVIF GRTRDILGAK
VCNARLSEFM PLDSPVTAVP ANALERGLAD LWTRTTETMA PGARETFRGT VEVMIDSWLW
ELANQAQNRI PDPIDYIEMR RATFGSDLTM SLARLARLAQ EQTVPPEIYR TRPIQALENA
AADYACLLND VFSYQKEIQF EGEIHNCVLV VENFLDCDRE RALAVVNDLM TSRIRQFEHI
VAHELPALFD SFALDASARQ ALLGYARELQ NWLAGILRWH EGTHRYEESE LRYHPAAGVR
PFGGPTGLGT SSAHVRPRPA AAAGAAGDSE M