Gene Francci3_2406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2406 
Symbol 
ID3906389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2787547 
End bp2790240 
Gene Length2694 bp 
Protein Length897 aa 
Translation table11 
GC content71% 
IMG OID637879736 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_481502 
Protein GI86741102 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0350264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAAG GCCAGGTCAC CCAGGCGGTG CCGGTACCGC CGCCGCTGCG TCCCGACGAC 
CGGTTCGGAA ACGTCGTCGA CCGGATCCGC ACCGTCGCCA TGGTCTGCCA CGACGTCGTG
GCGGTCCGCG ACGAACACCG TGCCGTTACC TTCGCCCAGC TCGTCGCCTG GGTGGACGTC
GTCGCCGACC GCATCCTTCG GCAGCCCGCC GCGGCTGACC CGGACACGCC CGTCGCGGTG
CTGTTGCCCC ACGGCGCGAG CGGGATCGCC GCCGTCCTGG GCGTCATCGC GAGCGGCCGG
CCCTGCGTGC CCCTCGACAG GATGCATCCC ACCGACCGTC TCGCTCAGGT CGTGGGACTG
GCGGGGGCCT CGGTGTGCGT CACCGGACCG ACTGGCTCCG CCGACCAGCG GACTGCCGCC
GCGCTGCCGG GGATCGTGGA GACTATCGAC GTCGGCGACG AGCGCGTCGC CGACTGGTCC
CCGGCGCTGG CTGACCGGGT GGCGTCGACC GCGCCGCGCC GGGCCGATAC CGATCCAGCC
GTGCTCATCT TCACCTCCGG GTCGACCGGA GTACCGAAGG GTGTCGTGTG GCACCACCGC
GCCCTGCTGG GCATCCACTA CGCCATCCAG GTCCAGGACG TGATGCGGCT GGTTCCGGGC
GACCGCCTGC CGCTCTTCCT GCCCTACTCC TTCATCTCGG GGATGAACCG GACGGTCGGT
GGCCTGGTCT TCGGCACCAC CCTCGAGATG TACGACCCTC GGGTCCGCGG CGTCCGAGAC
CTGGCCGACT GGCTGCGCGC CACCCGGCCG GCCGGCATCG TGGCCACGCC GGCGCTGATC
CGGATCGTGT TCGGCTGCCT TGAGCCGGAC GAGGTGCTCG ACGACCTCCG ATTCGTCATG
TCGGTCGGCG AGGCGATCTA CGCCCGCGAC GTCGAGTTGG CCCGCCACCA CCTACCGCCG
GCCGCGGCGT TCCTGGTCTC CTACGGAGCC TCGGAGCTCG GCACGGCGAC CTGTGCCCCG
ATCTGGTCGG ACGACGACCT GCCCGACGGC GTGATGCCGG CCGGGCGGCC GGTGGTTGAC
GTCGCGGTGA GGGTGGTCTC GCCGGATGGC ATCGAGATGC CGCCCGGTGA GACGGGGGAG
ATCGTCGTGA TGGGCCACTT CATCACGGGG GGCTACTGGC GGGCGCCGGC GGCGAGCGCA
TCGCGGTTCG GCATCGGGCC GGACGGCACA CCCACCTACC GCACCGGAGA CCTCGGGCGC
ATCGACGCCA GCGGCCAGCT CAGGGTTGTC GGCCGCAACG ACGCGGCCGT GAAGATCCGT
GGGTACCTCG TCGAGCCGAT CGAAATCGAG TCCGCCCTGC TGGCGAGCTC GGATGTGCTA
GAAGCCGTCG TCGTGGCCGA CAGGACCACG CAGCGGGCCC GCCTGGTCGC CTACGTCGTC
CCCGTGACCG GCGCCCGGGT CTCACCGGCC TCGATCAGGC GCCTGCTACG GGCCAAGCTG
CCGTCCTACA TGGTGCCGGC CACCGTCATG CTGGTCACCG CGCTACCGAG GACCGACCGC
AGCAAGGTCG ACCGACTGAA CCTGCCCCCC GCCGGTCCGA AACCCGGCCA GGACCCGCCC
CGCGACCAGT GGGAAGAGGC CGTCGCCGGT GTGTGGGCCG CCGCCCTGCA CCTCGACGAC
GTCGGTATCC ACGACGACTT CGTCGAACTC GGCGGCGATT CCCTCATCGC CGAAGAGCTC
CTCACCAGAG TCGCCGACGA ACTCGGTGTC AAACTCCCCA CCTCCACCGT CGCCGACGCC
CCCACCGTCG CCGAGTTCAC CGCCCGCCTG CGCAATGCCG GCACCGACGT CCTACGCCAC
CCCACCGTCG TCCCGCTGCG CACCACCGGC AGCGGCGGGC CCCTCTTCTG TTTCTGCGGC
GCCGGCGGCC TCGCTGTCGG CATGCTTGGC TTTGCCCGCC ACTTCGACGG CGAACGCCCC
GTCTACGGCG TCCAAGCCCA CGGCCTCGAA TACCGCGGTC TCCCCGACTG GTCCATCTAC
GCCGCCGCCC GCCGTCACGC CCGCACCCTG CGCCTACTCC AACCCGCCGG CCCCTACTAC
CTCGCCGGCC ACTCCTTCGG CGGCCTCGTC GCCCTCGAAA CCGCGAGACT CCTCACCGAG
GCCGGCGAGC ACGTCGAACT GCTCGTCCTC ATCGACAGCT TCCTGCCCGA CACGTCCTCC
GGCGTCTTCG CGGGAGGAGT GCAGCCCTCT CAGCTTCCAG TTCCGGGGGG CCCGACCAGG
AGCTCGAATC CGGCCCCAGC CTTGGACCGT GCGAGGGGAT CGGCTATGGG CCCGGCGCGC
GCGGGGGCTG CCGCACGGGT CCTGCTCGGC TGGGCCCGGC AGGCGGCCCA ACTCCCGTTG
GCTGGCGTCG TTCAGTTCAA GGGCATGAAC CAGTACGACG TCTTCTACAA CCAGTCGCGG
GTGCTCACGA GGTTCTACCG GCCCAAGCCC TGGAACGGGC GGGCGCTGGT CTACCTGGCG
GCCGCCTCGC CGCCGCACCG TACGGAGGCC TGGCGCCCGC TGCTGACCGG CGAGACGACC
TACCGCACCG TGGGCGGCGA CCACGACACC GTGCTGCGCG AACCCGTCGT AAGCGAGATC
GCGGCCGACA TCCGGGCGGT CCTCGCTGGC TGTACCAGCT CCCGCAAGGC GTGA
 
Protein sequence
MRKGQVTQAV PVPPPLRPDD RFGNVVDRIR TVAMVCHDVV AVRDEHRAVT FAQLVAWVDV 
VADRILRQPA AADPDTPVAV LLPHGASGIA AVLGVIASGR PCVPLDRMHP TDRLAQVVGL
AGASVCVTGP TGSADQRTAA ALPGIVETID VGDERVADWS PALADRVAST APRRADTDPA
VLIFTSGSTG VPKGVVWHHR ALLGIHYAIQ VQDVMRLVPG DRLPLFLPYS FISGMNRTVG
GLVFGTTLEM YDPRVRGVRD LADWLRATRP AGIVATPALI RIVFGCLEPD EVLDDLRFVM
SVGEAIYARD VELARHHLPP AAAFLVSYGA SELGTATCAP IWSDDDLPDG VMPAGRPVVD
VAVRVVSPDG IEMPPGETGE IVVMGHFITG GYWRAPAASA SRFGIGPDGT PTYRTGDLGR
IDASGQLRVV GRNDAAVKIR GYLVEPIEIE SALLASSDVL EAVVVADRTT QRARLVAYVV
PVTGARVSPA SIRRLLRAKL PSYMVPATVM LVTALPRTDR SKVDRLNLPP AGPKPGQDPP
RDQWEEAVAG VWAAALHLDD VGIHDDFVEL GGDSLIAEEL LTRVADELGV KLPTSTVADA
PTVAEFTARL RNAGTDVLRH PTVVPLRTTG SGGPLFCFCG AGGLAVGMLG FARHFDGERP
VYGVQAHGLE YRGLPDWSIY AAARRHARTL RLLQPAGPYY LAGHSFGGLV ALETARLLTE
AGEHVELLVL IDSFLPDTSS GVFAGGVQPS QLPVPGGPTR SSNPAPALDR ARGSAMGPAR
AGAAARVLLG WARQAAQLPL AGVVQFKGMN QYDVFYNQSR VLTRFYRPKP WNGRALVYLA
AASPPHRTEA WRPLLTGETT YRTVGGDHDT VLREPVVSEI AADIRAVLAG CTSSRKA