Gene Francci3_1178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1178 
Symbol 
ID3905289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1406876 
End bp1409728 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content71% 
IMG OID637878510 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_480286 
Protein GI86739886 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0144635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTTTGA ACGAAAGCGG TCGGCGCGCG TCCCTTGCGG TCGCGACGCC TGAAGGTGGC 
GCGTTCTTCC CGGCGTTGCA CTTCGACGAC CTGCCGGCCT CGGTGGTCAG CCGGTTCCGG
GAGGTCGCCA CGCACCTGCC CGATACACCC GCGCTGGTCT CGCCCGGCGT CGTCATGACC
TTCGCCGAGG CTGATCGCCG GACCGATGAC ATCGCGATGG CGGTCCTCGG CCGCCTCGAC
GCGAACGAGG ACGGTCCGGT CGCGACCCTG CTGCCGCACA GCGTGGCCGG CCTGCTGGGC
GTACTGGGCG TACTGAAAAC CGGACGCCCG GTCGTCCCGC TGGATCCGAT GGTGCCCGCC
GAGCGGATGG CGCAGATCGT TCGGCAGGCC GGCTGCGTGG CCCTGCTGAC CGACCTGGGC
GACAGTTCCG TGGCCCGCTC GACGGTGGGC CTGCGGCCGG TGGGGACCAA CGGTTCGAAC
GCCACGGCCG AATCCAGCGT GAATCCGGAC GCCCTGCTCG CCGCGCTGGC CGGGGGCGGC
CCCCGCCACG TCCTGGACCT CGCCGCGGCG GCCGCGGACG GCGCCCGGTG GATCGCGGCA
AACGGCGCGG ACGCGGTCTG GTGGCCGCAG CCGCTGGTCG ACGATCCGGC CTGCATCGTG
TTCACCTCCG GTTCGACCGG CGCGCCTAAG GGCGTGGTGT GGACAAACGG CACGTTCCTG
TGCGACGCCT ACGCCGGCGC CGAACGTCTC GGTTTCGCGC CGGGCGACCG GCTGGCCCTC
GTGCTGCCGT ACTCGTTCGC CGCCGGCATC ACCGTGGTGG TGTTCGGTCT ACTCAACGGG
GCCGGGGTGT ATGCCTATGA TCCACGGGCG GCGGGCCTGA GCGGCCTCGC CGACTGGATC
TCCTCGCAGC ACCTGACCGC GCTGCAGACC ACTCCGTCTC TGCTGCGCTC CCTCGTCGGC
TCGCTCGAAC CGGACCAGGT CCTCGCCGAC CTGCGGATCG TGACCACGTG CGGGGAGGCC
GTCTACGGAC GCGACATCAC GGCGCTGCGG CCGCACGTGC CACCGGCGTG TACCTACGTG
AACTGGTCCG GTGCCTCCGA GATCGCATCC CTCGGCTTCT TCGAGGTCCC GCCGGGCACG
CAGCCGCCCG CGGGCACGGT ACCCGCCGGC CTGCCGGCGA CCGGCAAGGA GGTGGTACTG
CGTCGCGAGG ACGGCACGCC CGCCGATCCC GGCGAGGTAG GCGACGTGGA GGTCACCTCG
GCCTACCTGT CGGCCGGGTA CTGGGGGAAC GCGGAGATGA CGACGTCCCG CTTCACCCCG
CTCACCGATG GCCGGACCAC CTGCCGGACC GGCGACGTCG GCCGGTTCGA ACCCAACGGC
ACGCTGATCC TGCTCGGCCG CCGGGACGCG GCGGTGAAGA TCCGCGGCTA TCTCGTCGAA
CCGAGCGAGG TGGAGGCGGC GCTGCTGAGC TCCGCGGAGA TCGCCGAGGC GGTCGTCACC
GCCGTCGCGC ATCCTTCGGC GCGGAACCGG CTGGTCGCCT ATGTCGTGCC GGCGGTACAC
GGCAACACCC TGTCCCCGGT CCGAATCCGG CGGAGGCTAC GCGAGAAGCT GCCGGTCTGG
ATGGTCCCGA CGACGATCAT CCCCCTGGCG GAACTGCCCC GCAACGAACG CGGCAAGGTG
GACCGGGGCG CGCTGCCGCC CCCGCCGGAG GCCCCGGCCG TCTCCGCCCG GCCGAGAACG
CAGTGGGAGA TCGTCGTCGC CGACATCTGG ACGCGGGTCC TCGATCTCGA GGAGGTCGGC
ATCGGGGACG ACTTCATGGA GCTCGGCGGC GACTCCCTGG CCGCGAACGA GCTGCTCACC
CTGGTCGCGG AGAAGCTCGG GATCACCATG CCGTCATCGG CGCTGGTCGA CGCGCCGACG
CTGGGCGAGT TCGCCCGCGC GCTGTCACTC GCGCAGCAGT CGGGTCCGCG ACACCCCACC
GTCGTCCCCC TGCGCACCAC CGGCTCGCGG CCGCCGCTGT TCTGCTTCGC CGGGGCCGGC
GCGCTCGCCC TCGGCTTCCA CTCACTGGCC CGCCGCCTCG GCGACGACCA GCCGGTCTAC
GCCTTCCAGG CGCATGGGCT GGAGCGGCGG GGCGTCCCGG ACTGGAGCGT CGCGCGGACC
GCCCGCCGCC ACCTGGAGAT CATCCGCGTC CTGGCTCCCC GGGGTCCCTA CCTGCTCGCC
GGGCACTCGC TGGGTGGCCT GATCGCCATG GAGATCGCCC AGCAGCTCGC CGCCGCGGGA
GAGGAGGTCG GTTTCCTGTC CATCATGGAC ACCTACCTGC CGGCCTCGCT GCGGATCGAG
TCGGCCGGGT CGGGTAGCGC CGCCGGGTCG GGCGGCGCCG CCGGGTCGTC GAAGGCAGCC
GAGGACCACG CGCGCGGTTA CCGCGGCCGG CTGGCGCGGT TCACCCAGAA GCTGCTACCG
GAACAGCGGG CGAACTTCAC CAGCAAGGCG ACCCTCAAGA AAATGGTGCA GATTCCGCTC
ACCGGGGTCG TACAGTTCGG CGGGCTCGAG CAGTTCGACG TCTTCTTCAA CCACGGGCGG
CTTCTCGAAC GTTTCTACCG ACCGCGGCCA TGGCCCGGCC GGACCCTCGT GTACCGATCG
GCGGAGAATC CGGACCCGGA GGACGCATGG TCAGCATTCC TCACCGGGAG CCATGACACC
CATTTCGTTC CGTGTGAGCA TTTCTACCTG CTCCGCGAAC CGCACATCAT CAAGATCTCG
GAGCATTTCC GGGCCGAGAT CGACCGGGTG GTCGCCGGGC TGACGAAGAC AGGTCGCCGG
GCTGACGAAG GCAGGCTGAC CGCGACGGAC TGA
 
Protein sequence
MALNESGRRA SLAVATPEGG AFFPALHFDD LPASVVSRFR EVATHLPDTP ALVSPGVVMT 
FAEADRRTDD IAMAVLGRLD ANEDGPVATL LPHSVAGLLG VLGVLKTGRP VVPLDPMVPA
ERMAQIVRQA GCVALLTDLG DSSVARSTVG LRPVGTNGSN ATAESSVNPD ALLAALAGGG
PRHVLDLAAA AADGARWIAA NGADAVWWPQ PLVDDPACIV FTSGSTGAPK GVVWTNGTFL
CDAYAGAERL GFAPGDRLAL VLPYSFAAGI TVVVFGLLNG AGVYAYDPRA AGLSGLADWI
SSQHLTALQT TPSLLRSLVG SLEPDQVLAD LRIVTTCGEA VYGRDITALR PHVPPACTYV
NWSGASEIAS LGFFEVPPGT QPPAGTVPAG LPATGKEVVL RREDGTPADP GEVGDVEVTS
AYLSAGYWGN AEMTTSRFTP LTDGRTTCRT GDVGRFEPNG TLILLGRRDA AVKIRGYLVE
PSEVEAALLS SAEIAEAVVT AVAHPSARNR LVAYVVPAVH GNTLSPVRIR RRLREKLPVW
MVPTTIIPLA ELPRNERGKV DRGALPPPPE APAVSARPRT QWEIVVADIW TRVLDLEEVG
IGDDFMELGG DSLAANELLT LVAEKLGITM PSSALVDAPT LGEFARALSL AQQSGPRHPT
VVPLRTTGSR PPLFCFAGAG ALALGFHSLA RRLGDDQPVY AFQAHGLERR GVPDWSVART
ARRHLEIIRV LAPRGPYLLA GHSLGGLIAM EIAQQLAAAG EEVGFLSIMD TYLPASLRIE
SAGSGSAAGS GGAAGSSKAA EDHARGYRGR LARFTQKLLP EQRANFTSKA TLKKMVQIPL
TGVVQFGGLE QFDVFFNHGR LLERFYRPRP WPGRTLVYRS AENPDPEDAW SAFLTGSHDT
HFVPCEHFYL LREPHIIKIS EHFRAEIDRV VAGLTKTGRR ADEGRLTATD