Gene Francci3_3502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3502 
Symbol 
ID3905236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4178830 
End bp4181448 
Gene Length2619 bp 
Protein Length872 aa 
Translation table11 
GC content73% 
IMG OID637880824 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_482584 
Protein GI86742184 
COG category[C] Energy production and conversion 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.667038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.35114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG AGCCTCCGGT CGTCTATCCG GCCGAGTGGG AGGCCGACGT CATCCTCAGT 
GACGGCGGCA CCGCCCACAT CCGCCCGATC CTGCCGACGG ACGGCCCGCT GTTGCGGACC
TTCTGGACCC GGCTGTCGAC CCAGTCGATC TACTTCCGGT TCTTCGCCGT GCGCCGTGCC
CTGAGCGACG CCGACATCCA CCGGATGACG ACCGTGGACC AGCGGCTGCG CGGCGCCATC
GTCGCGATGA TCGGCGACGA CCTGGTCGCG GTCTCCCACT GGGAGAGCAC CGCCGCACGG
CCGACCGAGG CGGAGGTCGC CTTCCTGGTG GAGGACGCCC AGCAGGGACG AGGGCTGGGT
TCGGTGCTGC TGGAGCATCT TGCCGCCGCG GCCTGGGACC GTGGGATCCG CCGCTTCGAC
GCCGACGTGC TCGGGGAGAA CCAGCAGATG ATCCGGGTCT TCCTCGATGC CGGTTACACG
GTCTCGAGGA CCTGGGACTC GGGCGCGGTC CGGCTGTCCT TCGAGATCAC CCCCACCCAG
AAGTCGGTCG GGGTGATGCG GGCCCGGGAG CACCATGCCG AGGCCGCCTC GATCGGTCGG
CTGCTGCATC CGCGCGCGAT CGCCGTCATC GGTGCGGGGC GGTCTGCCGC GTCCGTGGGC
AACGCGGTCC TGCGTCACCT GCTCGCCGGC GGTTTCGACG GCCCCGTCTA CCCGGTGAAC
CCGGCGGCGG CCGCGGCCGG CGGCGCGGTG GCGTCGATCC AGGCCTACGC CGGCATCGAG
GACGTGCCGC GTCCGGTCGA CCTGGCCGTG GTGTGCGTGC CGCCCGCGCA GGTGCCCGAC
GTCGTCGCCG CCTGCGGGCG GGCGGGAGTC TGGGGGCTGG TCGTCCTCAC CGACCAGCGC
GACGCCGCCG CAGACGCGGC CCTGGTGGCT GAGGCCCGGG CGGATGGCAT GCGGGTGGTC
GGCCCGGCGA GCATGGGCAT CCAGAATCCG GCGGCGGGGT TGAACGCCTC GATGGTGCCC
CGGATGCCCC CGGCCGGCCG CATCGGCTGC TACTCCCAGT CGGGTCCGTT CGGCGGGGCC
ATCCTGGCGG CGGCGGCCGC GCGCGGGGTC GGTCTGTCGG TCTTCGTCTC CGCCGGGGAC
CGGGCCGACG TGAGCGGCAA CGACCTGCTC CAGTACTGGG AGGAGGACCC CGAGACCGAC
GCGGTGATCA TGCATCTGGA GACCTTCGGT AACCCCCGCA AGTTCGCCCG GCTGGCCCGG
CGGGTGGGTC GGCGTAAACC AGTGATCGTC GTGTACTCCG GCCGTTCGAC CCTTGACGAC
GCGCTTCTGC GGCAGGCCGG GGTGATCGGC GTCGACCAGG TCTCCCAGGC GTTCGACGTC
GCGCTGCTGC TCACCACGCA GCCGCTGCCC GCGGGCGGTC GGGTCGCCGT CGTCGGCGAC
TCCCGGGCCC TGGTCCGGCT GACGGCCCGC GCCGCCGACG CGGCCGGGCT CAGAGTCGAG
GAGGTGCTCC TCCCGGTTGG CAGCTCCCCC GGGGACCTCA CCCGGGCGTT GACCTCCGCC
GCGGACCGGG CGGATGCGCT GATCGCCACG CTGGTCCGGC TGCCGCCCTC CCCGGCCGGG
CCGATCGCGG TGGACGCCGT CGCGGCTGCC GCGACCATCG AGATCCCGGT TCTCGCCGCC
GTGCAGGGGG TCGAGATGCC CGGCGAGCTC GCTGGCATCC CGGCCTATTC GTCGCCGGAG
GCTGCCGTCG CCGCGCTGCG CCGGGTGGTG GGCTACGCCC AGTGGTGGGC GCGCCCGATC
GGCACCGTGC CGACGACGAC GGTGCGAGCA GACGAGGCGC GTTCGCTCGT CGCCGGACTC
ACCGGTCGCC TTCCGGAGGA TCGGGCTGCC GCCCTGCTGG ACTGCTACGG CGTCACGGTG
GAGCCGGCCG TCCTGGTGAC GTCCCCGCGC CGGGCCGTCG AGGCGGCGGG CCAGCGCGGC
TATCCGGTGG CGCTCAAGGC GCGGTCGCGG CCCTACCGGC ATCGTCCGGA CCTGCGCGGT
CAGCGTCTCG ACCTGCCCGA CGCGGCAGCC GTGCGGGCGG CATGGGCGTC GCTGCGCTCC
CAGCTCGGGG CGGAGGTGCC GATTGTCGTC CAGCGGATGG CCCCGGTCGG CGTGTCGGTG
GTTGTCGGTT CCGAGGAGCA TCCCCGGTAC GGTCCCCTGG TCTCGTTCGG CCTCTCGGGC
CCCGCGACCG AGCTGCTGGA GGACCGCGCA CATCACATCC TTCCCCTCAC CGACGTTGAC
GCGGCGCGCC TCGTGCGCTC GGTGCGGGCG GCGCCGCTGC TGCTGGGATA CCTCGGCTCG
ACGCCGGTCG ACATCGCCGC GCTGGAGGAC CTGCTCCTCA AGATCGCCCG GCTCGCCGAC
GACGTGCCGG AGGTGGTTCA TCTCACGTTG GATCCAGTGA TCGTGTCGAC CGGGCGGGTG
ACCGTGCTGT CCGTGGAGAT CGTTGCTGGA CCGTCGGCGC CCCGCGCGGA CGTCGGACCT
CGGCGGTTCT GGACACCGAA CCACCCGACT CCGGACGGGG CGACTATCCG GGGTGGTTCA
GCACAACATG CGCACGCCGT CCACAATCGT CTCCCATGA
 
Protein sequence
MSTEPPVVYP AEWEADVILS DGGTAHIRPI LPTDGPLLRT FWTRLSTQSI YFRFFAVRRA 
LSDADIHRMT TVDQRLRGAI VAMIGDDLVA VSHWESTAAR PTEAEVAFLV EDAQQGRGLG
SVLLEHLAAA AWDRGIRRFD ADVLGENQQM IRVFLDAGYT VSRTWDSGAV RLSFEITPTQ
KSVGVMRARE HHAEAASIGR LLHPRAIAVI GAGRSAASVG NAVLRHLLAG GFDGPVYPVN
PAAAAAGGAV ASIQAYAGIE DVPRPVDLAV VCVPPAQVPD VVAACGRAGV WGLVVLTDQR
DAAADAALVA EARADGMRVV GPASMGIQNP AAGLNASMVP RMPPAGRIGC YSQSGPFGGA
ILAAAAARGV GLSVFVSAGD RADVSGNDLL QYWEEDPETD AVIMHLETFG NPRKFARLAR
RVGRRKPVIV VYSGRSTLDD ALLRQAGVIG VDQVSQAFDV ALLLTTQPLP AGGRVAVVGD
SRALVRLTAR AADAAGLRVE EVLLPVGSSP GDLTRALTSA ADRADALIAT LVRLPPSPAG
PIAVDAVAAA ATIEIPVLAA VQGVEMPGEL AGIPAYSSPE AAVAALRRVV GYAQWWARPI
GTVPTTTVRA DEARSLVAGL TGRLPEDRAA ALLDCYGVTV EPAVLVTSPR RAVEAAGQRG
YPVALKARSR PYRHRPDLRG QRLDLPDAAA VRAAWASLRS QLGAEVPIVV QRMAPVGVSV
VVGSEEHPRY GPLVSFGLSG PATELLEDRA HHILPLTDVD AARLVRSVRA APLLLGYLGS
TPVDIAALED LLLKIARLAD DVPEVVHLTL DPVIVSTGRV TVLSVEIVAG PSAPRADVGP
RRFWTPNHPT PDGATIRGGS AQHAHAVHNR LP