Gene Franean1_1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1903 
Symbol 
ID5670304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2282727 
End bp2284526 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content73% 
IMG OID641240824 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001506246 
Protein GI158313738 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC TGCTTCCCGA CGTCCTGCGT GGGCGGGCCG CGCTCGAACC GGAGCGGACG 
GCGTACGTCT TCGTGAACGA GCGTGCGGAG GAGACCGGCC GCCTGACCTA CGGCGGGTTG
CACGCCCGCG CGCTGGCCGT GGCCGGTGAG CTGTCCCGGG CCTGCGAGCC CGGTGACCGG
GCGCTGCTGG TGTTCCCGCA GTGCCTGGAC TTCGTCGTCG CGTACCTCGG CTGTCTGTAC
GCGCGGGTGG TCGCGGTGCC CGTCCACCCG CCGCACCGGG ACGGTGTCCA GGACTCCACC
CGCCGGATCG TGGACGACTG CGAGCCGGCC GCGGTTCTCA CGCTCGAGGC GATGGCCCGC
GAGCTCCGGG CCGCCCTGAC CTCGCCGCGC GGCGCGATGT CCTGGATCCC CGTCGACCGG
ATCCCCGCCG GCCCGGCTCC CACGGGCATC CACACCGTGG ACGTCCACGC CGCGGGCCTC
GACCCCACGG ACATCGAGCC CATGGGCCTC GATCCCGCCG ACATCGCGTT CCTGCAGTAC
ACGTCGGGCT CGACGTCGGA CCCCAAGGGG GTCATGGTCT CCCACGGGAA CCTCGCCGCC
AACCAGGAGA TGATCCGGCG GGCCTTCGGG CACGACCGCG ACTCGACGGT GGTCGGCTGG
GCTCCGTTCT TCCACGACCA GGGCCTGATC GGCAACGTGC TGCAGCCGCT GTACGCCGGG
GCCACCTGCG TCCTGATGTC CCCGACCGCC TTCATCCGGT GGCCGATGCT GTGGCTGTCG
CTGATCTCCC AATACCGCGC GCACACCAGC GGCGGCCCGA ACTTCGCCTT CGAGGCCTGC
GTGGCCCGCG CCGCCCGGGG CGGGGTGCCC GATCTCGATC TCAGCTGCTG GAAGGTCGCC
TTCAACGGCG CGGAGCCCGT CCGCCCCGAC ACCCTGCGAC GCTTCGCCGA GACGTTCGCG
CCGCACGGGT TCGACGAGCG GGCGCTCTAC CCGTGCTACG GCCTCGCCGA GGCGACGCTG
CTGGTGACCG GCAGCGAGAA GGGCCGCGGG GCGCGGGTGA TCGAGGCGGC CACCGAGGAC
CTGCGTGCGG GCCGCTACAC GCCGGTGCCC GCGGGCCGCG GCAGCACCCT GGTCGGCTCC
GGGGTCCTCC CCCGGGACGG CGCGCTGCGG ATCGTGGACC CGCGGACGGG ACGCTCTCTC
CCGCCGGACC GGATCGGGGA GATCTGGGTC GCCGGGGACC ATGTCGCGCG GGGCTACTGG
AACCGCCCGT GGGAGAGCAC CGAGACCTTC CGCGCCGAGC ACGCCGACGA GCCCGGCCGC
GGCTACCTGC GCACGGGGGA CCTCGGCCTG GTGGTCGACG ACGAGCTGTT CGTCGTGGGA
CGGCTGAAGG ACCTCGTGAT CATCCGGGGC CGGAACTACT ACCCCCAGGA CCTCGAGCAC
ACCGCGCAGT CCGCGCATCC CGCGCTGCGG CCGGGCGGGT GTGCCGCGTT CTCCGTTCCG
GGTGCCGACC GGGAGAGGCT CGTCATCGTC CAGGAGGTCA GGGGCGAGTT CCGGCGGCGG
GCCGACCCGG GTGAGGTCGC CGGGGCCATC CGGGCCGCGG TGGTGCGTGA GCACCAGGTC
TCCGTCGGGG ATCTCGTGCT GACGCTGCCG GGCCGGCTCC AGAAGACGAC CAGCGGAAAG
ATCATGCGAG CCGCGGCCCG ACGCCGCTAT CTGCGAGACG CCTTCGACCG CTGGACGCCG
CCGACCGGCC CGTCCACCGG CCCGTTTCCC AGCACCGACA ACCCGCATGA GAAGACCTGA
 
Protein sequence
MTTLLPDVLR GRAALEPERT AYVFVNERAE ETGRLTYGGL HARALAVAGE LSRACEPGDR 
ALLVFPQCLD FVVAYLGCLY ARVVAVPVHP PHRDGVQDST RRIVDDCEPA AVLTLEAMAR
ELRAALTSPR GAMSWIPVDR IPAGPAPTGI HTVDVHAAGL DPTDIEPMGL DPADIAFLQY
TSGSTSDPKG VMVSHGNLAA NQEMIRRAFG HDRDSTVVGW APFFHDQGLI GNVLQPLYAG
ATCVLMSPTA FIRWPMLWLS LISQYRAHTS GGPNFAFEAC VARAARGGVP DLDLSCWKVA
FNGAEPVRPD TLRRFAETFA PHGFDERALY PCYGLAEATL LVTGSEKGRG ARVIEAATED
LRAGRYTPVP AGRGSTLVGS GVLPRDGALR IVDPRTGRSL PPDRIGEIWV AGDHVARGYW
NRPWESTETF RAEHADEPGR GYLRTGDLGL VVDDELFVVG RLKDLVIIRG RNYYPQDLEH
TAQSAHPALR PGGCAAFSVP GADRERLVIV QEVRGEFRRR ADPGEVAGAI RAAVVREHQV
SVGDLVLTLP GRLQKTTSGK IMRAAARRRY LRDAFDRWTP PTGPSTGPFP STDNPHEKT