Gene Francci3_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1806 
Symbol 
ID3904036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2141065 
End bp2142645 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content72% 
IMG OID637879144 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_480911 
Protein GI86740511 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACCC CGTGGACCCT CGCGGCTCTC TGGCAGCGCA TCGCCGCGCA GCAGCCCCAC 
CAGACCGCGC TGATCCACGG CGATCAGGCG TGGACGTGGG CGCAGTTCGA CGCGGCATCA
GCGGCCCTCG CCCGCACCCT GCGGCGGCAC GGAGTGCAGG CCGGCCAGGT CGTCGCCCTC
TGCCTGCCGA ACATCCCCGA ACACCTTGTC AGCCTCGCTG CCGTGCTCCG TCTCGGCGCA
ACGCCCGCGC AGCTCAACCC CCGCTACCGT GCCCGCGAAC TCGACCAGCT TCACCGCCTG
CTCCAGCCCG CCGCAATGAT CGCCGATCCG GTCCAGGTCC CAGACGTCGC GACGCTCCAC
GCCCGCTGCC AACCTGGCCA GGAAGTACCG ATCCGCCCCC TCGGAGGCCA CGGGCTGCTC
CTGACCTCCG CGGCCCCGCG GGCATCCTCG TCGCCGCCGG CGAATGGGCC GCGACCGCCT
TTGATGATCA AATGTACGGG TGGGACCACC GGCACACCCC AGGCCGTGCT CTGGCGCGTC
GCCGACATCC TGGACAACCT CAACGCCCAC AACCCCTGGG CCCGCCACGA CCTGTCCCGC
CCTCTCACCC GGATGGCGTC CCTGACGCTG GCCGACGCGC GCATCGTCGT CGCCAGTCCC
CTCAGCCACG GCTCGGGTCT GACGCGCGCG ATGGGCGCGC TCTGCGCCGG CGGCACAGTG
ATCACCCTGC CAGGGTCGTC CTACGACCCC GACAGGGTGC TCGACACGGT CGTGCAGCAG
CGCGCCGACA CCCTGGCGAT CGTCGGCGAC GCCTACGCAC GACCGCTCAC GAGCGCGCTT
GGCGCCCGCC TCGGCGCTGA CCTGTCCGCT TTGCGGACGG TCACCTCGTC GGGCGCGCCG
TGGACGGACC AGGTCAAGAC CGAACTCCTC GCCCTGGTCC CACACCTGCG GCTGGTCGAA
ACCCTCGGCG CCACCGAGGC CACCGGCCTG GGATCATCAC TCGCCCGCCT CGGAGACGTA
CCTGCGACCG GGTCCTTCGA CCTCGGCCGA CACGCCCGGG TGTTCCACGC TGATGGAACC
CCGACCGCGG TCGGTGAGAC GGGACAGGTC GCGGTCCACC GGCCGCTGCC CGTCGGTCTG
CACCCCCACG GCACGCTTCC ACCCCACCGC TACGTCCGCG CCTATGACGG CCGCACCTAC
CTCCTGTCCG GAGACCTCGT CCGGCTTCAG ACGAGTCGGA GGATCGCGCT GCTCGGCCGC
GAGCAGGACT GCATCAACAC CGGCGGGGAG AAGGTGTACG CCCCCGATGT CGCCGCCGTC
CTCCTGGCCC ACCCCCACGT CGCCGACGCC GCCATCCTCG CCGTCCCCGA CACGCGGCTC
GGGAGCACCG TCGGCGGCCT CCTCCAGCTC CACGCCGGCG GCAGACTCGC ACAGGTACTG
GGTGACATCC GTGGCGACCT CGCCGGCTAC AAGATCCCAC GGGTGGTGCG GGTCGTCGCC
GCCATACCCA GGACCCCGGC CGGGAAGGTC GACCTCGTCC GCGCCCGCCA GCTCCTCAGC
GACCAGGAGG CCAGTTCATG A
 
Protein sequence
MTTPWTLAAL WQRIAAQQPH QTALIHGDQA WTWAQFDAAS AALARTLRRH GVQAGQVVAL 
CLPNIPEHLV SLAAVLRLGA TPAQLNPRYR ARELDQLHRL LQPAAMIADP VQVPDVATLH
ARCQPGQEVP IRPLGGHGLL LTSAAPRASS SPPANGPRPP LMIKCTGGTT GTPQAVLWRV
ADILDNLNAH NPWARHDLSR PLTRMASLTL ADARIVVASP LSHGSGLTRA MGALCAGGTV
ITLPGSSYDP DRVLDTVVQQ RADTLAIVGD AYARPLTSAL GARLGADLSA LRTVTSSGAP
WTDQVKTELL ALVPHLRLVE TLGATEATGL GSSLARLGDV PATGSFDLGR HARVFHADGT
PTAVGETGQV AVHRPLPVGL HPHGTLPPHR YVRAYDGRTY LLSGDLVRLQ TSRRIALLGR
EQDCINTGGE KVYAPDVAAV LLAHPHVADA AILAVPDTRL GSTVGGLLQL HAGGRLAQVL
GDIRGDLAGY KIPRVVRVVA AIPRTPAGKV DLVRARQLLS DQEASS