Gene Francci3_3797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3797 
Symbol 
ID3906082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4550253 
End bp4551740 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content75% 
IMG OID637881123 
Producthypothetical protein 
Protein accessionYP_482876 
Protein GI86742476 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00410976 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGG GAGGCCGTGC CCCGACCCGC CCGGGACCGA TTCCGCCGGC CGTCGGCGAC 
GACCACGGGC GGCCCCGCCC AGACGACGAG GTGCGGGAGA TTGCGACGGT CGGGCGGCAA
ACTGTCCTCA TGGGACAGTT TCCAGCCGGG CTGAGCGAGA TCCTCGCTGC CGAGTTCGAC
GCTCTGGGTG AAGAGATTAT CGCCGCGATC GCCCGGGAGG TTCCCGCCTA CGCGCGCCCG
CTGGAGGGCA AGTTCGGCCA TGGGGTGCGC CGCGGAGTCG ACGAGGCGTT GTTCCGCTTC
CTCAGCCTGG TCGAGGCCGG TCCCCACTGC ACTGTCGACC TCGCCGCCAG CCGGGAGGTT
TATGTCCGTC TCGGCCGCGG CGAGGTGTAC GCGGGACGGT CGTTGGACAA CCTGCTGAGC
GCCTACCGGG TGGGCGCCCG CGTCTCCTGG CGGCGCCTCG GGGAGGCGGC GGCCCGCCGC
GGCGGGCTGG ACGGCCCGGC GCTGGTGTCC CTCGCCGAGA TGATGTTCGC CTACATCGAC
GGCATCTCCG CGGCCTCCGC CGAGGGCTAC GCCTCCGAGC AGTACACCGC GGCCGGTGAA
CTGGAGCGGC TGTGGGACCG GCTGGGCGAG ATGCTGCTGT CCGGCGCGGC CGGCGGGGCG
ATCGCGCAGG TCGCCCGTTC CGGCAACGTG CGGCTGCCCG GCCGGCTGGC CGCCGTGCTC
GTCCCCGCCC CGACCGGCTC CGCGGCCACC CACCCGGCTG ACGGCGAAGC CGACCACGAG
GCCGACCACG AGGCCGACCA CGAGGCCGAC CCCTGGGCAG GCTCGCTCCC CTCCCGGTTG
CCCTCGGGCT GCCCCCGCGC CGTGCAGGGG GCAGACATCT GGGTGTTCGT CGGCTCAACC
GAGCGGGCGG CCACCCGGGC GGCACTGGCG AAACAGCTCG CCGGGCTCGC CGCGGTGGTG
GGACCGGCCG TGCCGTGGGG TCAGGCGGCG GCGAGCGCGG CGCGGGCGAG GTTCGCCTGC
GATGCCCGCA GCGCCGGGCG GCTTCGCGGT ATCGCCGCGG CCGACCCCCT GTTCACCGAC
GAACATCTCA GCGCCCTGCT GCTGGCCAGC GACCCCGGGC TCATCACCGA CCTCGCGTCC
CGCCGGCTCG CCCCCCTTGA CGGGCTGCCC GACCGGACCA GGGAACGGCT GGCCGAGACC
CTCCTGCACT GGCTGACCCT GCGGGGGCAG CGCGGGCTGA TCGCCGAGCG GTTGCACATC
CATCCGCAGA CCGTCCGCTA CCGGGTCAAC CAGCTTCGCG AGCTGTTCGG ACCATGTCTG
GAGGACCCTG ACACCCGCTT CGATCTCGAA CTGGTGCTGC GCGCCGGGGG CGCCGAGGAC
GTGGCCACCG ACCCGGTGAG CGCCGCGGTC CGCGAGGACA CGGACGAGGG CCGGGGCCCG
GGCTCCCGCC GTCCCGCGGC GGTGCGGGGT CGGCCACTCG GGCCGTGA
 
Protein sequence
MATGGRAPTR PGPIPPAVGD DHGRPRPDDE VREIATVGRQ TVLMGQFPAG LSEILAAEFD 
ALGEEIIAAI AREVPAYARP LEGKFGHGVR RGVDEALFRF LSLVEAGPHC TVDLAASREV
YVRLGRGEVY AGRSLDNLLS AYRVGARVSW RRLGEAAARR GGLDGPALVS LAEMMFAYID
GISAASAEGY ASEQYTAAGE LERLWDRLGE MLLSGAAGGA IAQVARSGNV RLPGRLAAVL
VPAPTGSAAT HPADGEADHE ADHEADHEAD PWAGSLPSRL PSGCPRAVQG ADIWVFVGST
ERAATRAALA KQLAGLAAVV GPAVPWGQAA ASAARARFAC DARSAGRLRG IAAADPLFTD
EHLSALLLAS DPGLITDLAS RRLAPLDGLP DRTRERLAET LLHWLTLRGQ RGLIAERLHI
HPQTVRYRVN QLRELFGPCL EDPDTRFDLE LVLRAGGAED VATDPVSAAV REDTDEGRGP
GSRRPAAVRG RPLGP