Gene Franean1_4754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4754 
Symbol 
ID5673096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5677533 
End bp5679254 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content72% 
IMG OID641243611 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001509027 
Protein GI158316519 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.548745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCG ATTCCGCGAG CGAGGACGTC CTCTCGACCG CTCGGCCGTG GCTCGCCGCG 
TACCCACCGC AGATCGACCC GTCGCCGCGC CTCGCCCACT CCTCGATCGT CGACGCGTGG
CGGGACCGGG TGGCCGGCAA CCCCGGCCGG ACGGCGGTGC GGTACTTCGA CGGCTCCCTC
AGTGCGGCCG AACTCGACGC CCACACCGAC GCCCTCGCCG CCGAGCTGCA GGAACGGGAC
GTCCGGCCGG GCGAACGCGT GGGCGTCTAC CTGCAGAACG TGCCGCACTA CCCGATCAGC
CTGCTGGCGA TCTGGAAGGC CGGCGGCATC GCGGTGCCGC TCAACCCGAT GTACCGCGGG
CGCGAGCTGC GGCGCCTGAT CGACGACTCC GGCACCACGG GGATCATCCT GGCCCGCGGG
ACCGACGCCC AGACCCGCGA GACGCTCGCG GGCAGCACCG TCCGCTGGCT GTTATCCGCC
TCCGCGCGCG ACTTCCAGAC GGCGGACGAC CCGCGCGTCT TCGGCGCCGC CGGCGATCCC
GCGCCGTCGC CCGACGGCGA TCTCGCGGCG ATCCTGTCGG AACGCCGGGG GCAGCACCCC
GAGCCGGTCG AGACCGCCCT TGACGACACC GCCTTCCTCA CCTACACCTC AGGCACCACG
GGCCCACCCA AAGGCGCGCT GAACACACAC CGGAACTGCC TCAACTCGGT CCTGAACTAC
GGCCGCTGGC TACTCCTCCA GCCTGGCGAC GTGGTGTTCG CGATCGCCCC CCTCTTCCAC
ATCACTGGGC TCTCCCTGAA CGCGGGCATC GCACTGCTGA ACGACACGAC ACTGAGCATG
AGCGGCCGCT TCGAGCCCTC AGTCGTGCTC GAGGCCTTCC GTGACCACGG CGTGACGACC
ACCATCGGTT CGATCACGGC GTTCAACGCC TTTTTCCGGG TGGACGGCGC GGGCCCCGAG
CATTTCGCCG CGGTGAAGCG GCTCTATTCC GGCGGCGCGC CCATCCCGCC GTCGACGGTG
GAGGCCTTCC GGTCGAGGTT CGGGCCCTAC CTGCACAACA TCTGGGGCAT GACCGAGACG
ACCGGCGGTG GCATCGCCGT GCCCCCCGGC GCGGCGGCAC CCGTGCACGG CCCGAGCGGC
ACCCTGTCGA TCGGTGTGCC GATGCAGAAC GTCGACGTGT GGATCACCGA CGAGAGCGGC
GCGCCGCGGC CGCCGGGCGT CGAGGGTGAG CTGGTCATCT CCGCGCCGCA GGTCATCCCC
GGCTACTGGC GCAACCCCGA AGCGTCGGCG CACGCGCTGG CGGGCGGGCG ACTGCGCACC
GGGGATGTCG CCGTCCTCGA CGCCGCCGGC TGGGTCTATC TCGTCGACCG GGTCAAGGAC
CAGATCAACA CGTCGGGCTT CAAGGTCTGG CCGCGCGAGG TCGAGGACGT CCTGTACGAG
CACCCCGACG TCTTCGAGGC CGCCGTGGTC GGCCTGCCGG ACGCCTACCG CGGCGAGACC
GTGGCGGCCT ACGTGTCGCT GCGCGACGGC GCGGCCACCA CTCCCGAGGA GCTGACCGCC
TTCGCCCGGG AGCGGCTCGC CGCGTACAAG TACCCGCGCC GGATCTCGAT TCTTCCGGAG
CTGCCGAAGA CCGCCACCGG CAAGATCCAG CGGGCGGTGC TGCGCGAACA GGCCCCCGCG
GGACAGGTGA TCCCTGCCCT GCCGCGGGCA GAACCCGGCT GA
 
Protein sequence
MPTDSASEDV LSTARPWLAA YPPQIDPSPR LAHSSIVDAW RDRVAGNPGR TAVRYFDGSL 
SAAELDAHTD ALAAELQERD VRPGERVGVY LQNVPHYPIS LLAIWKAGGI AVPLNPMYRG
RELRRLIDDS GTTGIILARG TDAQTRETLA GSTVRWLLSA SARDFQTADD PRVFGAAGDP
APSPDGDLAA ILSERRGQHP EPVETALDDT AFLTYTSGTT GPPKGALNTH RNCLNSVLNY
GRWLLLQPGD VVFAIAPLFH ITGLSLNAGI ALLNDTTLSM SGRFEPSVVL EAFRDHGVTT
TIGSITAFNA FFRVDGAGPE HFAAVKRLYS GGAPIPPSTV EAFRSRFGPY LHNIWGMTET
TGGGIAVPPG AAAPVHGPSG TLSIGVPMQN VDVWITDESG APRPPGVEGE LVISAPQVIP
GYWRNPEASA HALAGGRLRT GDVAVLDAAG WVYLVDRVKD QINTSGFKVW PREVEDVLYE
HPDVFEAAVV GLPDAYRGET VAAYVSLRDG AATTPEELTA FARERLAAYK YPRRISILPE
LPKTATGKIQ RAVLREQAPA GQVIPALPRA EPG