Gene Franean1_3057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3057 
Symbol 
ID5671436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3595869 
End bp3597494 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content65% 
IMG OID641241955 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001507375 
Protein GI158314867 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCC AACCGCTGAC CCTTGACGGC GTCGTACCCT ATCCGCCAAA GTCCGTGGCC 
CAGTACCTCA CAAAGGGATA CTGGCACGAT CAGACGCTTC CGGAGTTGCT TTTTGCCGCG
GCGGAACGCT ATCCAGACAA GCTCGCGGTC ATCGACCGCG ACACCCGGCT GAGCTATAGG
CAACTCACGG ATGAAGTCCT GCGTCTCGCG GCCGGTCTGC AGGATCTGGG GCTGGGCCGT
GGCGACCGAG TGGTCGTGCA TCTGCCCAAT ACCTACGAGT ACATCGCGTT CGTCTTCGCC
CTGTGGGAGC TCGGCGTGAT TCCCGTGGTC GCGCCCATCG CGCACCGTCG CGCGGAGATC
GAACACTTCA TCGAGATTGC CGAGGCGCGT ACCTACATCA CCGTCGCATC GGACAGCGGC
ACCGATCTCG CTGCGTTGGC TGCCGACCTG AAGGGTCGTT GGGCCCATCT GGAGCACACA
GTGATCCTCG ATCGCGGCGG CGGCGGCGCC GAGTACGACG CGCTGCTTTC GAAAGGATCG
CTCGAGCACG TGCGGCGATG CAGCCCGCAG GACGTCGCGC TGCTGCAGAT GTCGGGCGGC
ACTACGGGCG TTCCGAAGCT GATGCCGCAT ACTCATCACA CGTACGGTTA TGCGTTGCGC
AGAAGCGTCG GGGAACGGGG CATCACCGAG CGGACGGTGC ATCTGCTGGT CATGCCGATC
TGTCACAGTA TGTCGACGCG TTCGCCCGGC TTCCTCGGCG CGTTCAGCGT GGGCGCAACG
ATCGTGATCG CGCCGAACGG CAGCCCCGAT GCGGCGTTCC CGCTGATCGA GAAGCACCGC
GCGAACCGCG TGACGTTGGT GCCCCCGATC CTGCTGGCCT GGCTGAACTC CTCGCTGCGC
GACGCCTACG ATCTCTCCAG CATGCAGGTG ATCATGTGCG GGGGTGCGAA GCTGAGCGAG
GAGGTGGCGC GGCGGGTCGA GCCGGAGCTC GGCATGGAGC TGTCTCAGAG CTTCGGGATG
GGCGAAGGCC TGGTCGTCAG CAACCCTCCG GACGTCGATC GGGAGACCAG CGTGCGATAC
CAGGGCCGGC CGGCCTCGGA GGCCGACGAG ATCCGGGTCA TAGACGACGA GGGAAACGAC
GTGCCGCCCG GGGCGCCCGG TCACCTGCTG ACGCGGGGGC CGTCGGTAAT CCGCGGCTAC
TATCGCAACC CCGAGCAGAA TGCCCTGGCG TTCACCAGCG ATGGCTTCTA CCGCACCGGT
GACATAGTCG AGCGGGACGA GCGTGGGTTC ATGAGGGTCG TGGGCCGGTC GAAGGACCAG
ATCAACCGTG GCGGCGAGAA GATCGCGCCG GAGGAGCTGG AGAACGCGTT CCTCGCCCAC
GGCGGCGTGC ACAACGCCTC CGTGATCGGC ATCGATGACG AGGTGCTCGG CGAGCGCATC
AAGGCGTACC TGATCCCTCG GTCGCCGGAG GACGTGGCCG ATCTCACCTT GAGCAAGCTG
CGTCGGTTCC TGCAGGAGTG GGGCCTCGCG ACATTCAAGC TTCCCGATTT CGTGGAGGTG
GTCGACAAGT TCCCGTACAC CGCGGTCGGC AAGGTCAGCA AGCGACTGCA GCGCGAGCAG
AACTAG
 
Protein sequence
MSIQPLTLDG VVPYPPKSVA QYLTKGYWHD QTLPELLFAA AERYPDKLAV IDRDTRLSYR 
QLTDEVLRLA AGLQDLGLGR GDRVVVHLPN TYEYIAFVFA LWELGVIPVV APIAHRRAEI
EHFIEIAEAR TYITVASDSG TDLAALAADL KGRWAHLEHT VILDRGGGGA EYDALLSKGS
LEHVRRCSPQ DVALLQMSGG TTGVPKLMPH THHTYGYALR RSVGERGITE RTVHLLVMPI
CHSMSTRSPG FLGAFSVGAT IVIAPNGSPD AAFPLIEKHR ANRVTLVPPI LLAWLNSSLR
DAYDLSSMQV IMCGGAKLSE EVARRVEPEL GMELSQSFGM GEGLVVSNPP DVDRETSVRY
QGRPASEADE IRVIDDEGND VPPGAPGHLL TRGPSVIRGY YRNPEQNALA FTSDGFYRTG
DIVERDERGF MRVVGRSKDQ INRGGEKIAP EELENAFLAH GGVHNASVIG IDDEVLGERI
KAYLIPRSPE DVADLTLSKL RRFLQEWGLA TFKLPDFVEV VDKFPYTAVG KVSKRLQREQ
N