Gene Franean1_5941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5941 
Symbol 
ID5674262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7238440 
End bp7240176 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content75% 
IMG OID641244789 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001510191 
Protein GI158317683 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0214212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCC CGGCGCCGGC GCCGGTGGAC GAGGGCTGGA CGCGCTGGCC GGCGGAGCTC 
GAGCGGCGCT ACCGGGACGC GGGGTACTGG GGCGACCAGA CCCTGGGTGA CCTGCTGCGG
GAGTGGGCGC GCCGCTTCGG GCCGGCGACG GCGCTCGTCT CGGGACCCGA CCGGATCAGC
TACGCCGAGC TGGACGCGGC CGTGGACGAT CTCGCCGCCG GGCTGGCCAC CATCGGCATC
GGTCCCGGTG ACCACGTGGT GGTGCACCTG CCCAACCGCG CCGAGTTCGT GACCGTGCTG
TTCGCGCTGC TGCGCCGGGG CGCGATCGGC GTCCTCTCGC TGCCCGCCCA CCGCAGGGTC
GAGATCGAGC ACCTCGCGCG GCTCTCGGGC GCCGTCGGCT ATGTGATCGC CGACCGGCAC
GAGGGCTTCG ACTACCGGGA GCTGGCCCGG GAGGTCACCA AGGCGGTTCC CGCCGTGCGC
CACGTCCTGG TGGCCGGTGA GCCCGGCCCG TTCACCGCGC TGTCGGCGCT CGCCGCGGCC
GGGCGCGCCG CCCGGTCCGG CGGCGGCACA CGGACCGGGG CGGCCGGCGG TGCTGTCTCC
GTCACCGCGG CCGGCGCCGC TACCGACGAC GCCGGTAGCG GCGGGGCGGG CCCGGATCTC
GGCGCCGAGC GACCGGATCC GGGCGGTATC GCGGTGCTGC TGATCTCCGG CGGCACCACC
GGCAAGCCGA AGCTGATCCC GCGCACCCAT CGCGACTACG CCTACAACGC CCGCGCCAGC
GCGGACGTCT GCGGGCTGAC GGCGGACGAC GTGTACCTCG TCGCGCTGCC CGCCGCCCAC
AACTTCCCGC TGGCCTGCCC GGGGCTGCTC GGCGCGTTCG GCGTCGGGGC GACGGTCGTG
ATGGCTCCCT CCCCGAGTCC GGACGTCGCC TTCGACCTGG TCGCCCGCGA GCGGGTCACC
GTCACCGCGC TCGTCCCCCC GCTGGCCCGG CTGTGGGTCG AGGCGGCCGG GTGGGAGGGG
CCGGACACGA CGAGCCTGCG GCTGGTCCAG GTGGGCGGCG CCAAGCTCGA CGAGGGCCTG
GCGCGGCGGA TCACGCCGAC CCTCGGGGCG AAGGTCCAGC AGGTCTTCGG CATGGCCGAG
GGCCTGCTGA ACTACACCCG CCTCGACGAC CCGGACGAGC TCGTGTTCAC CACGCAGGGC
CGGCCGCTGG CCGCGGCCGA CGAGGTCCGG GTGCTCGACG GGTCCGGTGC CGGGGTGGTC
CCGGGCGAGG TGGGCGAGCT GTGGACCCGC GGGCCGTACA CGATCCGCGG GTACTACCGG
GCCGCCGAGC ACAACTGCGC CGTCTTCGAC GGCGACGGGT ACTTCCGCAC CGGCGACCTC
GTCCGCCAGC TCCCGACCGG CCACCTCGTG GTCGAGGGCC GGGTCAAGGA CGTGATCAAC
CGGGGTGGCG AGAACGTGTC CGCGGGTGAG CTGGAGGAGC ACCTGCTGAC CCATCCGGCG
ATCGGGCAGG TGGCGGTCGT CGGCATGCCG GACCCGGACG TGGGCGAGAG CGTCTGCGCG
GTGGTGGTCC TCGCGCCGGG CGAGGCCCTC CGGCTCAAGC AGGTCAAGAG CTACCTGCAG
GAGCGCGGCC TGGCCCGGTT CATGCTCCCC GACCGGCTCG AGGTCGTCGG CGAGTTCCCG
CTCACCGCCG TCGGAAAGAT CGACAAGCGG GAGCTGCGGA GCTGGCTGGA CCCCTGA
 
Protein sequence
MTAPAPAPVD EGWTRWPAEL ERRYRDAGYW GDQTLGDLLR EWARRFGPAT ALVSGPDRIS 
YAELDAAVDD LAAGLATIGI GPGDHVVVHL PNRAEFVTVL FALLRRGAIG VLSLPAHRRV
EIEHLARLSG AVGYVIADRH EGFDYRELAR EVTKAVPAVR HVLVAGEPGP FTALSALAAA
GRAARSGGGT RTGAAGGAVS VTAAGAATDD AGSGGAGPDL GAERPDPGGI AVLLISGGTT
GKPKLIPRTH RDYAYNARAS ADVCGLTADD VYLVALPAAH NFPLACPGLL GAFGVGATVV
MAPSPSPDVA FDLVARERVT VTALVPPLAR LWVEAAGWEG PDTTSLRLVQ VGGAKLDEGL
ARRITPTLGA KVQQVFGMAE GLLNYTRLDD PDELVFTTQG RPLAAADEVR VLDGSGAGVV
PGEVGELWTR GPYTIRGYYR AAEHNCAVFD GDGYFRTGDL VRQLPTGHLV VEGRVKDVIN
RGGENVSAGE LEEHLLTHPA IGQVAVVGMP DPDVGESVCA VVVLAPGEAL RLKQVKSYLQ
ERGLARFMLP DRLEVVGEFP LTAVGKIDKR ELRSWLDP