Gene Francci3_2245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2245 
Symbol 
ID3905013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2618950 
End bp2620509 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content68% 
IMG OID637879576 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_481342 
Protein GI86740942 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00923204 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0183444 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATC TCACCGGGAA AAGTGACCTG ACCCTGAAGG CGGTGTTCGT CGACGCGTTG 
GACCGGTTCG GTGCGCGTCC CGCCCTGCAC TATCAGGGGC GGACCTACGG GTACGGCGAG
ATCGTGGCCG CGGCGAACCA GCTCGCGCAC CGGCTGCGTG CGGCGGGGGT GGGGCCGGGG
GTGTCGGTGG CGTTGATGAT GTCCAACCGG CCCGAGTACA TCGTCGCGGA TCAGGCGATC
CTGCGGTGTG GCGCGGTCAA GGTGGCGCTC AACGACATGC TGTCGGCCAG CGAGATCGAC
TACATTCTGC GGGACAGCGA GGCCCGGGTC GTCCTCGCCG ATGCGGGGAT GCTCCCGGCT
GCGCTGCACT CCGCGCCGCC CCTGTTGGAG ACGGTCATCG CCGTCGCCGA CCCGGACGAC
TGCCCGGGCG GGGTGGTGGC GTGGCACGAC GCGCTGGCCG GGCAGCCGAC CACCGTGCCG
GAGGTCGACC CGACACCCAC CGACCCGGGG TTGATCGTCT ATACCGGGGG TACGACCGGT
CTGCCCAAAG GGGTGATGCA CACCCAGCGG AATCTCGCGC TCAATCTGTT CTCGCACGTG
ATGGAGATGG GGCTGCTCGA CGACGAGGTG CTGCTGTTGA TGTCGCCGCT GCCGCACAGC
GCGGGTTTCC TGCTGCAGGC CGGGATGCTC AAGGGGGCCC GGCACTTCCT GGAGACCAGG
TTCGACCCGG AGCTGGTGCT TGAGCGGATC ACCGCCGACC GGGTGACCTT CACGTTCATG
GTGCCTACCA TGATCTACCG GGTGCTTGAC CGGGCGGCGG GCCGCGCGTT GGACCTCAGC
TCGCTGCGGA CCATCCTGTA CGGTGCCGCG CCGATCACCC GGGAGCGGCT GGAGCAGGGC
CTGGAGGTGC TCGGCCCGGT GTTCATGCAG CTGTACGGGC AGTCGGAGGC GCCGAACTTC
ATCACCCGTC TTCGCCGTGA GGATCATCGT CTTGATCCTG ACGGGGAGCA TCGGCTGGCC
AGCTGTGGTC AGCCGGTCGT CATGGCCACG GTCAGGGTGG TCGACGAGGC TGGCCGGGAG
CTGCCCCGCG GTCAGGTCGG GGAGATCGTC GCCGCCACGC CGTACACGAT GGTGGGGTAT
CGGGGCCGGC CCGAGCAGAC CGCCAAGGCG CTGCGGGACG GGTGGTTGCA TACTGGGGAT
ATCGGGCGGA TGGATGCCGA GGGGTACGTC TATCTGCTGG ACCGCAAGAA CGATATGATC
ATCACCGGTG GGATGAACGT GTACAGCACG GAGGTGGAGA ACGCGGCGGC GGCCTGTCCT
GGGGTTGGGC AGGTCGCGGT CGTCGGGGTG CCGCATCCGG ACTGGGGTGA GGCGGTCGTG
GCGTTCGTCG TGCCCGATGA TACCGGTGCG TTCGACGAGG CCAAGCTGCT GGCGCACTGT
CGGGTCGAGC TTGCCCGGTA CAAGCAGCCC AAGGCCGTGC GGGTCGTCGA GGCCCTGCCG
ACCACCGTGT ACGGCAAGCT GGACAAGAAG GCGCTGCGGG CCGGCTGGCC CGGTTGGTGA
 
Protein sequence
MSDLTGKSDL TLKAVFVDAL DRFGARPALH YQGRTYGYGE IVAAANQLAH RLRAAGVGPG 
VSVALMMSNR PEYIVADQAI LRCGAVKVAL NDMLSASEID YILRDSEARV VLADAGMLPA
ALHSAPPLLE TVIAVADPDD CPGGVVAWHD ALAGQPTTVP EVDPTPTDPG LIVYTGGTTG
LPKGVMHTQR NLALNLFSHV MEMGLLDDEV LLLMSPLPHS AGFLLQAGML KGARHFLETR
FDPELVLERI TADRVTFTFM VPTMIYRVLD RAAGRALDLS SLRTILYGAA PITRERLEQG
LEVLGPVFMQ LYGQSEAPNF ITRLRREDHR LDPDGEHRLA SCGQPVVMAT VRVVDEAGRE
LPRGQVGEIV AATPYTMVGY RGRPEQTAKA LRDGWLHTGD IGRMDAEGYV YLLDRKNDMI
ITGGMNVYST EVENAAAACP GVGQVAVVGV PHPDWGEAVV AFVVPDDTGA FDEAKLLAHC
RVELARYKQP KAVRVVEALP TTVYGKLDKK ALRAGWPGW