Gene Franean1_2704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2704 
Symbol 
ID5671095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3199252 
End bp3200889 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content71% 
IMG OID641241616 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001507036 
Protein GI158314528 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGACA AGATCGAGAA AGAACCAACC CAGGCCGAGC AGGCCGTTAC CCGGAGGCGA 
CCGCTCTTCC CGCGGGCCTC CCCGGAGCGC GTGGCGCGCT ACCGGGCGGA GCGACTGTGG
GACGACCGGG GTCTGGCGGA CGGTGTCGAG GCGGCAGCCG TCCGACGGCC GGACGCGCCG
GCGATCGTGG ACAACGATCG GCGGCTCACC TATGCCGAGC TGAGCGGGGC CGTGGCCAGC
GGGGTGGCGG CTCTGGCCGC ACGGGATGTG CGGGCCGGCG ACGGCGTGGT CCTCATCAGC
GGCAACACCC GCCACGGAGT GATCGCTTAC CATGCCCTGC TGCGCACCGG TGTCACGGTG
CTGGTGCTGG ATCGGCGCTG CGGTGTCGCG GACATACTGT TCGCCCTGGA CGCGCTCCCC
GGTCGGGCCC GCGTGATCGT CCCCGCCGGG GAGAAAAACC GCCTCGACGA GGCACTGACC
GCCGCCGAGG TTCTGCCGCT CGAACTGTTC GACGTCCAGC CGGCGCCCCT GGCCCCGCCG
ACACGGACAC CGGCGGCGTG GGCCGAACCG GACCGCGACC GTGCCGCGGT GATCCTGTTC
AGCTCGGGAA CCACCGGCAG GCCCAAGGGC GTCGTCCACT CGCTCAACAC GCTGACCGCC
GGCACCGCCA ACATGGCGCG CGTCACCTCG ACCGACCTGA GCTCGGTGGT CTTCCTCGTC
AGCCCGCTGA CCAGCATCAC CGGCCTGATG CAGATCCAGC TCGCAGCCGA TCAGCACGGC
ACGCTCGTTC TGGAGGACCG TTTCCAGCCC GAGCAGACAC TGCAACGGAT GAACGCGGTG
GGCGCGACCC TGTTGGGCGG CGCACCGGTC ATCGCCGAGC GGCTGCTGGC CGCCGCGACA
TCCGCGGGAC CGGGCACCGG CGTCAGCCTG CGGACACTCG CACTCGGCGG CGCGATGCTG
CCGCGCCCGC TGCTCGAGCT GGCCACGGAC ACGTTCGGGA TCGAGATCGC CCGGGTGTAC
GGCTCATCCG AGGCGCCCAT ATTCTCGGGG AGTCTGCCGC TCGACGAGCG TGAGCGACGG
CTGTCCGACG ACGGCGCGCT CATGCCCGGT GGCGAGATGC GTGCCGGCTC CACCGCTCAC
CCGCGGGAAG GCCTCCTGCG AGGGCCGAGC GTCTTCCTGG GATATCTGGA CCCGGCGGAC
GACGAGGCCG CGTTCGAGGA CGGCTGGTAC CGCAGCGGTG ATCAGATCGA GGTGCACCAG
GGCAGGCTGA CCGTCGTCGG GAGGATCAAG GAGATCGTCA ACCGCAACGG CCTCAAGATC
TCGCCGAGCG AGATCGACAC CGCCCTGGCG GGGTTGCCGG GGGTGCTTGA ACACGCCTCG
TTCGGGCTCC CCGACCCATC GACCGGCGAA CGGCTCGCGG TCGCGGTCGC GGTCGCGGTC
GGCAGCATCG TCACGCTCGA CGACGTCGTG GCGCATCTCC TCACCCGGGG GATAGCCAAG
CGCAAGCTGC CGGAGCAGCT CGTGCGCTGG GACGGCCCAC TCCCCCGCAC CATCTCCGGG
AAGGTCGTCC GATCCCGGCT CGTCATGGAG TCACCGGCGA AGGACAGCGA CCTGGCAGTG
CGGCTGCGGG AGCACTGA
 
Protein sequence
MPDKIEKEPT QAEQAVTRRR PLFPRASPER VARYRAERLW DDRGLADGVE AAAVRRPDAP 
AIVDNDRRLT YAELSGAVAS GVAALAARDV RAGDGVVLIS GNTRHGVIAY HALLRTGVTV
LVLDRRCGVA DILFALDALP GRARVIVPAG EKNRLDEALT AAEVLPLELF DVQPAPLAPP
TRTPAAWAEP DRDRAAVILF SSGTTGRPKG VVHSLNTLTA GTANMARVTS TDLSSVVFLV
SPLTSITGLM QIQLAADQHG TLVLEDRFQP EQTLQRMNAV GATLLGGAPV IAERLLAAAT
SAGPGTGVSL RTLALGGAML PRPLLELATD TFGIEIARVY GSSEAPIFSG SLPLDERERR
LSDDGALMPG GEMRAGSTAH PREGLLRGPS VFLGYLDPAD DEAAFEDGWY RSGDQIEVHQ
GRLTVVGRIK EIVNRNGLKI SPSEIDTALA GLPGVLEHAS FGLPDPSTGE RLAVAVAVAV
GSIVTLDDVV AHLLTRGIAK RKLPEQLVRW DGPLPRTISG KVVRSRLVME SPAKDSDLAV
RLREH