Gene Francci3_1929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1929 
Symbol 
ID3904291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2266390 
End bp2268336 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content74% 
IMG OID637879266 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_481033 
Protein GI86740633 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.362101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCA TCCGGAGCGA GGTCACCTCG CCCACCTCCG CCATCTCGCC CATGTCTCCG 
ACTCTCCATC CGCGGCCGGG CCCCCCACGG TCGGCGACGA CCGGCGGGCC GACCCACGCG
ACCCACGGCA GCCACCCGAC CGACCCGACC GGCACGACCG ACGCTCGCGG GCACCATGAG
CCGCTCGACG CCGAGACGTT GCACGACGCA CTGGACGTCG CGGCCGCCCG GCATCCCGAC
ACGCCCGTCA CGTTCCCGTC CACCCACGAG GTGATCTCGC TGCGGGACCT CGCGGCCACC
TCCCGGATCA TCGCGGCCGC GCTGACGACG GACGGGGTCC AGGCCGGCGA GCGGGTTGGG
GTGCTGGCCG CGAACTCGGC CGAGTTTCTG CTCGCGCTCT TCGCGATCAG TCGGGCGGGC
GCGGCGGCCT GTCCGCTGCC ACTGCCGACC ACCGCGCACG ACCTCAACGG CTATGCCGAC
CGGGTCTCGC GCACTACGGC GGCCGCGGGC ATCCACCGGG TGGTCACCGG CGGCCGAGTG
TCGGCCATCC TGCGCCGCAC GGCCGACCGG CTCGGCGGAC TCCGGTTCCT CCCCGCGGCT
GACCTGGTCC GCGACGTCGG TCCCGCCACG CCCACACGCC AGCCACATCC GGGTCCCGAA
TGCCCGGTGG GTGCGGTGGG TGCGGTGGGT GCGGACGACG TCGCGATCGT GCAGTTCACC
TCCGGCAGCA CCGCGGCCCC GAAGGGCGTG GTGCTCAGCC ACCGCGCCGT GCTGTGCGGC
ATCCGCGCGA TCATCGACGG CATCCGGCTC GGTGAAGGCG ATCACGGCGG AATCTGGTTG
CCGCTGTTCC ACGACATGGG GCTGTTCGCG ACCCTCAGCG CGATCATGAC CGGCATCCCG
ATGACCGTCT GGTCCCCGGC GGACTTCGTC CGTGACCCCG CCGGCTGGCT GCGCTCGTTC
CTCGCCAGCG GCGCCACGAT CTCGCCGGCG CCGAACTTCG CCTACGACGA CCTCGTCCGG
GCGATCGACC CAGACGAGGT ACCCGGGCTG GACATGCGCC GGTGGCGCGT CGCGCTGAAC
GGAGCCGAGC CGGTCTCCGC GGTCGGCGTC GAGCGCTTCC TCGACCACTT CGCGCCGGCC
GGGTTCGCCC CCACGGCGAT GTTCCCCGTG TACGGGATGG CCGAGGCCAC CCTGGCCGTG
GCCTTCCCGC CGCTGGGCCG CGCCCCCGTG GTCACCTGGG TCGACCGGGA CCGACTCGCC
GCCGAGGGAA TCGTGACGGA GGTCCCGCGC GAGCATCCGC GCGCCAAGGG GCTGGTCGCC
GTCGGCCGGC CGGTCCGCGA CATCCGGATT CGCATCGCCG ACCTGCACGG TGACGTCCTG
GCCGATCCGC TCGGCGACGG TCTGGCGCCG AGCCTGCGGG TGGGCGAGAT CCAGATCCGG
GGTGGTTCGG TCACCTCGGG CTACCTCACC GCGGGGGGCC TGACGACCGG GGGCTTCACA
GCGGACGGCT GGCTGCGCAC CGGCGACCTC GGCTTCCAGC GCGGCGACGA TCTGTTCGTG
ACCGGCCGGG ACAAGGAAAT GATCATTATA CGCGGCGTGA ACTACTACCC CGAGGACGCC
GAAGCGGCGG TGCGTGACCT TCCGGGCGTG CACCGACGTC GCGTCGTCGC CTTCGGCTCC
CCCGACCAGC CGGACGCCGC CGGCAGCCCT CCGGACGCCA CCGGAGCGAA GGCTGCCGGC
GGCGTCACGG TGGTCGGCGA GACCGCGCTG ACCGAGCCCG CGGACCGAGC CCGACTCGCG
ACCGACCTGC GGATGGCGGC GACTGCGGCG CTGGGGCTGT CCACGGTGAC GGTGCGTCTG
GTCGAGCCCG GCGCGCTGCC CCGCACCTCC AGCGGCAAAT TCCAACGGCT GGCGGTACGC
GACCTGGTCG GGCGCGGCGT GCTCTGA
 
Protein sequence
MPAIRSEVTS PTSAISPMSP TLHPRPGPPR SATTGGPTHA THGSHPTDPT GTTDARGHHE 
PLDAETLHDA LDVAAARHPD TPVTFPSTHE VISLRDLAAT SRIIAAALTT DGVQAGERVG
VLAANSAEFL LALFAISRAG AAACPLPLPT TAHDLNGYAD RVSRTTAAAG IHRVVTGGRV
SAILRRTADR LGGLRFLPAA DLVRDVGPAT PTRQPHPGPE CPVGAVGAVG ADDVAIVQFT
SGSTAAPKGV VLSHRAVLCG IRAIIDGIRL GEGDHGGIWL PLFHDMGLFA TLSAIMTGIP
MTVWSPADFV RDPAGWLRSF LASGATISPA PNFAYDDLVR AIDPDEVPGL DMRRWRVALN
GAEPVSAVGV ERFLDHFAPA GFAPTAMFPV YGMAEATLAV AFPPLGRAPV VTWVDRDRLA
AEGIVTEVPR EHPRAKGLVA VGRPVRDIRI RIADLHGDVL ADPLGDGLAP SLRVGEIQIR
GGSVTSGYLT AGGLTTGGFT ADGWLRTGDL GFQRGDDLFV TGRDKEMIII RGVNYYPEDA
EAAVRDLPGV HRRRVVAFGS PDQPDAAGSP PDATGAKAAG GVTVVGETAL TEPADRARLA
TDLRMAATAA LGLSTVTVRL VEPGALPRTS SGKFQRLAVR DLVGRGVL