Gene Francci3_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2033 
Symbol 
ID3906750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2390630 
End bp2393725 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content72% 
IMG OID637879370 
Productlantibiotic dehydratase-like 
Protein accessionYP_481136 
Protein GI86740736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.423471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.294126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATCAGC ATCTCGACTC CCTGGTGGTG CGTGCCGCGG TGAACCCGCC CGTGGACGTG 
GCCGAGCGAT GGCCCGACCT GGTCGGACCG GCGGCCACAC CAGAGTCCTG GCGCCTCTGG
TTACGGGAGG TGATGCAGAA TCGCGCCTTC GAGATGGCGC TGGAGCAGGC CAGCCCTGTG
CTCGCTCGCC GCGTTCGGGA GATCCGCGAC GGACGCCAAG TGCCCCAGGC CGGGGTGCGG
CGGGCAGTGC TGTCGGTGAT GCGTTACCGG CTTCGCGCGT CGGGACGGGC GACACCGTTC
GGCCTGTTCG CGGGGGTCGC GCCGGTGCGT GTCGCTGACC ACGCCTCGGT GCGCGCAGGC
GTGGCGCACC GTGCCGTCGC GCGGGTCGAG GCTGGCTGGC TCTGCGCGGT GGTCGACCGG
TTGGAAGCCG ATCCGGTCGT CCGAGCGCGG CTGGATGTGG TGGCGAACAA CCTCGTGTTC
GAGCGGGACG GCCATCTGGT GCTGGAGCAC CGTGCCGCAG GAGGCACCGA CGGTGTTCCT
ACGCGCGTGC AGGTGCGGGC GCACGCACCG GTCCGCGCCG CGATGGGTAT CGGCCGCCGC
TCGGTGCGGT TCTCCGACCT CGCGGCCAGG CTCGCGTCCG AGTTCGCTCC GGTCCCTGCC
GACGTCGTCG ACAGACTTCT GGCCGATCTG GTGGCGCAGC GGTTCCTGCT CACGAACCTG
CGGCCTCCGA TGACGGCGAG CGATCCTCTC CGTCATGTCG TGGGTGTGCT CCAGACGACC
CTAAATGGCG GGCCGGTCAC CGTCGAGGCG GCCTGGGTCG CCGAACGCCT GAACGGGATT
GTCGATGGGC TGCGGAGCCA CGACAGCGCG TCGAGTTGGA CCGTCGCGCG CAGCAGCCGC
GAGCGCCTCG GCGAGGACAT GGCTGAGGTC CACCCTTCGG CGGAGCCGGC ACTCCGGGTT
GATCTGCGGA TGGACTGGGC GCTGGCTGTG CCCCGCGCGG TCGCGGTCGA GGCCGCAGCG
GCGGCGGGCG TGCTGGTGCG CCTGGCGCGG CGCTCGACGC TCAGCTCCGG CTGGGACGCC
TGGCATGGCC GGTTCCTCGA ACGGTACGGG CCGTGCGCTC TGGTTCCCGT TTTCGACGCG
GTCGACCCCG AGATCGGACT GGGTTATCCC GCCGGCTACG CCGGCAGTCC CGCACCCGCG
GGCGTGACGT TCACCGATCG GGACGCGAAG CTGTTGGCGT TGGCGCAGAA CGCCGCGCTG
CGCCGCGAAC GGGAGATCGT CCTCGACGAC AGGATGATCA GCGACCTTAC GGTCGTCGAC
CAGGACGCCC ACGTTCAGCC CACCACGGAA CTCACCCTGC GCGTCCACGC CACGAGCACC
CGCTCGCTCG ACGATGGCGC GTTCACCCTC GCGGTCGTCG GGGTGTCACG CAGAGCCGGC
ACCACGACCG GACGACTCCT CGACCTCTTC GACGCGAAGG ACAGTGCGCG GATGCGCGCG
CTGTACGCGG GTCTTCCCAC CGCGAGCCGC GACGCGCTGA GCGTGCAGAT CTCCGCGCCC
GCTCGCTACA CGAACACCGA CACGGTGGGA CGCGCCCCCC AGATCATGGC GCACCGGCTC
TCCCTCGGCG AGTACGACGA CAGCGAGGAC GGGGACTCGC TGCCTCTGGA CGACATCGTG
GTGACGGCCG ATGTGCACCG GATCTATCTG CTCTCGCTCT CCCGCCGCCA ACTGGTGGAA
CCGGTGGCGC TCAACGCGGT GGAGCCGGTG CGACGCGCGC ATCCGCTGGC GCGTTTCCTC
GCCGAGGCGC CGGCCGCGCT GAGCGTTCCC TGCGCCCCCT TCGACTGGGG GGTCGCGGCC
CAGCTACCGT TCCTGCCGGC GCTGCGGCAT GGACGAACAG TCCTCTCACC AGCGCGCTGG
CTGCTGGGGA CCGCGGACCT GCCCGGCTCC GCGGCCGGCT GGACGGAGTG GGACCGTGCC
CTGGCCAGCT GGCGGGAGCA GGTCAACCTC CCCTCGGCGG TCTACCTCGG CGACGGTGAC
CAGCGCATCG GACTGGACCT GTCTGAGCCG GCCCACCGGG TACTGCTGCG CTCCCACCTC
GACCGCTCCG GCGCCGCGCT GCTGCGCGCC GCCCCCGACA CCGACGCGGC CGGCTGGGTC
AGCGGGCATG CCCACGAGAT CGTCGTCCCC CTGGCAGCGG TCGCCGCGCC GGCGGCGACA
CCGGGCTGGT TCAGCCGGGC GCAGGTCGTT GGCCGCGATC ATGGGCATCT TCCGGGCTGC
GACGCACGGT TCTCAGTCAA GATCTACGCC GGCCTCGGTT GCCAGGACGA CATCCTCACC
CGCCATCTGC CGCACCTCGT CCACGAGCTG AACGGGCAGC AGGCCGCGGG CGAGGCGCGC
TGGTGGTTCC TGCGCTACCA CGACCCCGAC GACCATCTGC GCCTGCGGCT CGCGGTCACC
GCCGACGGTG TGGCGCCGAC CGCCGACCGG ATCGGTGCCT GGACCCAGCG GCTCCGCCGC
GCCGGCCTGA TCTCCCGCGC GCAGTGGGAC ACCTACTTCC CCGAGACCTC ACGCTTCGGG
GGCACCGCCG CGATGGACGC CGCGGAGGCG TACTTCGCCG CCGACTCAGC GGCGGCCCTC
GCCCAGCTCG CCGCCTCCAG GGAGAAGAGC GGGCCCGACC GTCGCGCGCT GACCGCCGCC
AGCATGCTCG CCATCGTGAC CGGCCTGATC GGCGACACCG CCGAGGCGAC GAGCTGGCTC
ATCAGCCACA CCCGGACGGA GCCGTCCGCC CCAGCCCGCG CGCTGTACCA GCAGACCGTC
GCCCTGGCGA ACCCGGCCGA CCCACGCGCG CTGGCCGCAC AGCCCGGAGG CGAGCACGTC
CTCTCCTGCT GGGCGCACCG GCGCGAGGCA CTCACCGCCT ACCGACACGT CCTCCAGGAA
ACCCGCGCGG AGGCCCCCAC CTCCCTGCTG CCTGATCTTC TGCATCTCCA CCATGTCCGC
ATGGCCGGAG TCAGCCTGGC CGGGGAACGC GCCTGCCTGC ACCTGGCTCG CGCCGCCGCA
CTGAGCTGGA CCGCGCGCAC GAGGAACCCA TCATGA
 
Protein sequence
MYQHLDSLVV RAAVNPPVDV AERWPDLVGP AATPESWRLW LREVMQNRAF EMALEQASPV 
LARRVREIRD GRQVPQAGVR RAVLSVMRYR LRASGRATPF GLFAGVAPVR VADHASVRAG
VAHRAVARVE AGWLCAVVDR LEADPVVRAR LDVVANNLVF ERDGHLVLEH RAAGGTDGVP
TRVQVRAHAP VRAAMGIGRR SVRFSDLAAR LASEFAPVPA DVVDRLLADL VAQRFLLTNL
RPPMTASDPL RHVVGVLQTT LNGGPVTVEA AWVAERLNGI VDGLRSHDSA SSWTVARSSR
ERLGEDMAEV HPSAEPALRV DLRMDWALAV PRAVAVEAAA AAGVLVRLAR RSTLSSGWDA
WHGRFLERYG PCALVPVFDA VDPEIGLGYP AGYAGSPAPA GVTFTDRDAK LLALAQNAAL
RREREIVLDD RMISDLTVVD QDAHVQPTTE LTLRVHATST RSLDDGAFTL AVVGVSRRAG
TTTGRLLDLF DAKDSARMRA LYAGLPTASR DALSVQISAP ARYTNTDTVG RAPQIMAHRL
SLGEYDDSED GDSLPLDDIV VTADVHRIYL LSLSRRQLVE PVALNAVEPV RRAHPLARFL
AEAPAALSVP CAPFDWGVAA QLPFLPALRH GRTVLSPARW LLGTADLPGS AAGWTEWDRA
LASWREQVNL PSAVYLGDGD QRIGLDLSEP AHRVLLRSHL DRSGAALLRA APDTDAAGWV
SGHAHEIVVP LAAVAAPAAT PGWFSRAQVV GRDHGHLPGC DARFSVKIYA GLGCQDDILT
RHLPHLVHEL NGQQAAGEAR WWFLRYHDPD DHLRLRLAVT ADGVAPTADR IGAWTQRLRR
AGLISRAQWD TYFPETSRFG GTAAMDAAEA YFAADSAAAL AQLAASREKS GPDRRALTAA
SMLAIVTGLI GDTAEATSWL ISHTRTEPSA PARALYQQTV ALANPADPRA LAAQPGGEHV
LSCWAHRREA LTAYRHVLQE TRAEAPTSLL PDLLHLHHVR MAGVSLAGER ACLHLARAAA
LSWTARTRNP S