Gene Francci3_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2046 
Symbol 
ID3904619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2407411 
End bp2408898 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content67% 
IMG OID637879383 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_481149 
Protein GI86740749 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.920452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.502259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCTTA CCCGCACCCT GAGGCGCTAC GCCAAGGAGA TTCCAGACGC CCTGGCGCTG 
GCCGATGCCA CCAGGGAGCT CACCTGGCGC GACTTGAGCA TCGAGGTGGA CCGGCTCGGC
GCCTTCATCT GCGCTTCGAC CGACCGCGGC GCCCGCGTGG CGTTCCTCAG CCACAGCCGC
GCGGAACACT TCGTGCTGCT CTTCGCATGC GCGATGAACG GGCGGACCTT CGTTCCCCTG
AATCCGAACC TGACCACGCC CGAGCTTGTT CACCAGGTCA GTCTGGTGAC GCCCTCGCTG
GTCTTCCACG AGGCGGCCAC CGACAAGTCC GCGGCGCTTC TGGTGGACAA ACTGGACTGG
GTTCGCGCCC GGGACGTCGA TGACGTCCCC GACGCCCGGC CGACTGCCCC GGCCCTGCTG
CGGCTGGAGG ATCCCGCCGT CATCTTTTTC ACCTCGGCGA CGACCGGGCG GCCGAAGGGC
GTGCAGGTCC CTGAGCGTTC CCTGCGGGCG AACTCGGTCG GATGGCAGGG CGATGTTCTC
GAGAGGTACC CGGACGCACG TTTCCTCAGC GCCTGCCCGC TTTACCACGG CAGCTCCGTC
ATAGCGCTCG ACTACCTCAG CAACGGACGC CCCGTCCATA TCATGCGGAG CTTCAACCCT
CGTTCATGGC TTCGCGCCGT CAAGAGGAAC CAGATCAGCC ACAGCTTTCT CGTCCCATCG
ATGATCACGC TTCTCATGAA GGTGTCGCAG CTGGACCGGT CGGAGACCGA GTCGCTCGTG
CTGCTGGCCC ATGGCGCCGC TCCTATGCCC TCGAAGCTTG CCGAGGAGGC GCGCGACCGG
CTGGGGGTCG ATCTGTTCAG TGTCTACGGC ATAACCGAAG GCGGCGGACC AGCAATCGTG
GGCACGCTCC CACCGAGCCT CATCGGGCCG TTCCCCGGCG CCACATATCT TGGCTTTCCG
CTGAAGGGGA TGATCGCGCG GGTCCTCGAT GACGAGGGAC GACCGGCGCC GCCAGGCCAC
GCGGGCGAGA TCGCCCTGCG CGGAGACGGT CTGATGACGC AGTACTGGCA TGACCCAGGA
GCCACCACCC AGTCGATCGT CGACGGCTGG CTCCGCACGC ACGACGTCGG GGTCCAGGAT
GACGAGGGCG TCTACTGGAT CCTCGACCGG CGCACTGATC TGATCATTCG TGGCGGTCAG
AACGTCTACC CCGCCGAAGT AGAAGCAGTC GTTCGCACGG CGCCAGGAGT GCGGGACGCC
GCGGTCGTCG CGGCACCGTC GACCATTTGG GGCCAGACAC CGGTCGCGTA CGTCGTCCCC
ACCGAGCAGG GCTCCACGAG CGAGGCCGAC ATCGTCGGCT GGTGCGCGGG CCGGCTCGCC
AGCTACAAGA CCCCGACGCA GGTTATCTTC ATCCCCGAGT TGCCCGTCGG GCCCTCCGGA
AAGGTCTTAC GTCGTGCGCT ACGAAAGTTC GAGAACGGTG TCCGCTGA
 
Protein sequence
MWLTRTLRRY AKEIPDALAL ADATRELTWR DLSIEVDRLG AFICASTDRG ARVAFLSHSR 
AEHFVLLFAC AMNGRTFVPL NPNLTTPELV HQVSLVTPSL VFHEAATDKS AALLVDKLDW
VRARDVDDVP DARPTAPALL RLEDPAVIFF TSATTGRPKG VQVPERSLRA NSVGWQGDVL
ERYPDARFLS ACPLYHGSSV IALDYLSNGR PVHIMRSFNP RSWLRAVKRN QISHSFLVPS
MITLLMKVSQ LDRSETESLV LLAHGAAPMP SKLAEEARDR LGVDLFSVYG ITEGGGPAIV
GTLPPSLIGP FPGATYLGFP LKGMIARVLD DEGRPAPPGH AGEIALRGDG LMTQYWHDPG
ATTQSIVDGW LRTHDVGVQD DEGVYWILDR RTDLIIRGGQ NVYPAEVEAV VRTAPGVRDA
AVVAAPSTIW GQTPVAYVVP TEQGSTSEAD IVGWCAGRLA SYKTPTQVIF IPELPVGPSG
KVLRRALRKF ENGVR