Gene Franean1_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0056 
Symbol 
ID5668482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp69424 
End bp72531 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content72% 
IMG OID641238985 
Productlantibiotic dehydratase domain-containing protein 
Protein accessionYP_001504430 
Protein GI158311922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATGC CCCCGCGCTA CCAGCACACC GGCTTCATCC AGGTCCGCGC CTCGACGGAC 
CCGGGCGGCC TCGACCTGCC GACTGATCTC GATCTGTCCG ACTCCGCTGC GGTCGAGCAG
GAAGGCCGGG CATGGCTGGC CAAGACCTGG GCTCGCCCGG AGGTACGGGA GGCGCTGCGG
CTTGCCAGCG CCGATCTCTG CGCCCGCATC GAGCAGATCC TTGACGACGG CGCCGGGCGG
CGTTCCGGGG CGGAGATCCA GCGTGCCCTC ACCGCGGTGG CGTCCTACCT GCTGCGGTGG
CAGCGGCGCG CGACGCCGTT CGGAATGTTC GCCGGCATCA GCACCGCGAC CGCCGGGCCG
GCAACCGCGG AGGTGGGACG CGCACATCAA GCCGTGGCAC GCGCGGACGC CGAGTGGGTC
ACGGCGCTCG CCGATCAACT CGAACGACAT CCGGAGCTGC GTGGGCGGTT GACGGTGGTG
ACCGACAGTG CGCGGATCGT GCGCGACGGT CGGGTCATCG TGCACCGGCG GGCCGAGGTG
GGCGCCCCGA GCCCGGGGCC GCTGCGGGAG TCGTCGGTAC GGCTGACCCG GCCGGTCTTC
TTCGCCTTGG CGGAGGCCGG CAACCCGATC CGGCTGGACG CGCTCGTCGC GCGGACGGCG
ACCAGGTTCC CCACCGCCTC CCCGGACAAG ATCCGCGCGC TGGTGCATGG CCTGGTCGAC
GGCGGCCTCC TGATCACGAG CCTGCGACCA CCGACGACAG GCGTCGATGC CCTCACCCAC
CTGGCCGACG CACTACGTGC CGCAGGCCTC GAGGGCCTGG CTGACGTGAG TGACCTGCTG
CGGGAAATCG ACTGCCTCCG CGCCATGCTG GCCGCCCACA ACCGCATCAC TGATCCGCAG
CCGGCGGCGC AGACGCGCCG GACGGTCGCC GCGCGGATGG CCTCGCTTGC ACCGGGAGCA
GGTCAGATGC TGGCGGTCGA CACGCGGCTC GACGCGAACA TCGCCGTCCC CGGGCGGGTC
CTCACCGAGG CAGCGCTGGC GGCGAGCGTG CTGCTGCGAC TGTCCCGCCA GCCCTTCGGC
TCGACAGCCT GGCTGGACTA TCACGCCCGG TTCCGGGCCC GCTACGGCCC CGGTGCCCTG
GTGCCGGTGC GGGAGCTGGT CGCCGAATCG GGGCTGGGCT ATCCCGGCGG GTACCTCGGT
GCGCCGCGAG CGAGACCTGC CTGGCGGATG CTGACCGACC GTGACACCGC GCTCCTGGCG
CTGATCCAGC GGGCCGCCGT GGACGGGGCC GAGGAGATCG CGTTGACCGA CGCGGACGTC
GAGGAACTGG CTGTCGGGGA GCGCGGGGAC GTCGTGCCGC CGCAGCGGGT CGAGCTCGGC
GTCGAGGTGT GGGCGGTCTC GACTCGCGCG ATCGACCGCG GGGACTTCCG GCTGCGGGTC
ACCGCCGCCC CGCGGACCGG TACCAGCATG GCCGGACGGT TCGCCTACCT GCTCGGCCCA
GCCGACCGCG ACCGGCTGGC CGCGACGTAC GCGGCCAGTG CTGAGGCTGG GGAGCGGGAC
CAGCCCTTGG CGGCACAGCT GTCCTTCCCG CCGCGTCGCC CGCACAACGA GAACGTGGTG
CGCGTCGAAC CGCTGTTGCC GGCCGTCATC TCTCTCTCGG AGCATCCTGA CCCCGCCCAC
GCCAAGGTCA GGTTGATCGA TCTGGATGAC CTGGTGGTTA CCGCCGACGC CGCGCAGATG
TACCTGGTGC AGCGCTCGAC CGGCCGGCGG GTGATCCCTC GGATTCCGCA CGCGCTGGAC
ACCAGCGTGC AGAGCCCACC GCTGGCCCGT TTCCTCGCCG AGGTCGCCGA CGCCCGTACC
GCCGTGTTCG GCGGGCTCGA TCTCGGCGCG GCCCGTGTCC TGTCCTACAT CCCGCGCATC
CGCTACGGCC GCACGGTCCT GGCCGTCGCC CGCTGGACGC TCACCTCCAC CGACCTGCCC
CGCGGCCAGT CCGGCGACGG AAGGCAGGAA GCGCTGTGCG CCTGGCGGCA GCGGTGGCGT
GTGCCCGCAC GGGTCGTGCT CTGCCACAGC GAGCTTCGCC TACCCCTCGA CCTGGATCGC
CACCTGGACC GGGTGCTGTT CCTGACCCGC CTGGAGCGAG CCGGCCGCAT CGAGGTGCAG
GAAGACGGTC CGCCCGACGC GCAGGGCTGG ATCGGGCGTC CGGCCGAGCT GCTGATCCCG
TTGACCGCGA TCAGCCCACC GGCACGGCCA CTGCCCGTGA CGGCAGCGCC GGGGGCGGTG
CTTCGTCCGG GTGACGCGGC CGTTCTACAC GCGCAGCTTG TCGGAAACCC GGCCCGGTTC
GACGACATCC TCACCGGCCA TCTGCCGAGG CTCGCCGACA GCCTGGAAGG GCTGGTCTGT
CTCTGGTGGG TCCGACGACA CCGCGACATG ATCCTTCCCG AGAGTGATCA GCACCTCGCC
GTGTTCCTAC GCCTGCGCAG CACAGACCAC TACGGGCCGG TTGCCGCCGC GGTGGCCTCC
TTCGCGGCCG ATCTGGAGAC TCGCGGCCTT CCCGGTCAGC TCACCCTGGC TTCCTCCCCG
CAGCACCCCG CCCGCTACGG CGACGGTGAC GCGCTGGCCG CCGCCGAGTC GGTGTTCGCC
GCCGACACCC TCTGCGCCAC CGCCCAGATC ACGGCGGCCC AGACGTCCGG GATCTCCGCG
CAGGCCCTGG CCGCAGCATC GATGGTGGAC CTGGCCGCGG CCTTCGCCCC CGACCCGCTG
GCCGGGTACC AGGCACTCGT CGGATGCCTT CGGCAGGAGC ACGGCGCCCT GGACCGCACC
CTACGGGACC AAGCTCTCGA CCTGGCCGAT CCCGGCGGTA ACCACCAGGC GGTTCGGGCC
CTGACCGGCG GTGAGGCAGT CGTCAGCGTC TGGAACGCCC GAGCCATCGT GCTGGCCTCT
TACTACCAGG CCCTCGCGCG GCAGCGTGAT CCGCGGACAG TGCTGCGGAC CCTCCTGCAC
GAGCACCATG TGCGCGCTGT CGGTGTGGAT CCGACCTTGG AGAAGGAGAC CGGGCGCCTC
GCTCGCGCTT CCGCACTGCG CCACCTCGCG CTGGCCGGTG CCCGATGA
 
Protein sequence
MPMPPRYQHT GFIQVRASTD PGGLDLPTDL DLSDSAAVEQ EGRAWLAKTW ARPEVREALR 
LASADLCARI EQILDDGAGR RSGAEIQRAL TAVASYLLRW QRRATPFGMF AGISTATAGP
ATAEVGRAHQ AVARADAEWV TALADQLERH PELRGRLTVV TDSARIVRDG RVIVHRRAEV
GAPSPGPLRE SSVRLTRPVF FALAEAGNPI RLDALVARTA TRFPTASPDK IRALVHGLVD
GGLLITSLRP PTTGVDALTH LADALRAAGL EGLADVSDLL REIDCLRAML AAHNRITDPQ
PAAQTRRTVA ARMASLAPGA GQMLAVDTRL DANIAVPGRV LTEAALAASV LLRLSRQPFG
STAWLDYHAR FRARYGPGAL VPVRELVAES GLGYPGGYLG APRARPAWRM LTDRDTALLA
LIQRAAVDGA EEIALTDADV EELAVGERGD VVPPQRVELG VEVWAVSTRA IDRGDFRLRV
TAAPRTGTSM AGRFAYLLGP ADRDRLAATY AASAEAGERD QPLAAQLSFP PRRPHNENVV
RVEPLLPAVI SLSEHPDPAH AKVRLIDLDD LVVTADAAQM YLVQRSTGRR VIPRIPHALD
TSVQSPPLAR FLAEVADART AVFGGLDLGA ARVLSYIPRI RYGRTVLAVA RWTLTSTDLP
RGQSGDGRQE ALCAWRQRWR VPARVVLCHS ELRLPLDLDR HLDRVLFLTR LERAGRIEVQ
EDGPPDAQGW IGRPAELLIP LTAISPPARP LPVTAAPGAV LRPGDAAVLH AQLVGNPARF
DDILTGHLPR LADSLEGLVC LWWVRRHRDM ILPESDQHLA VFLRLRSTDH YGPVAAAVAS
FAADLETRGL PGQLTLASSP QHPARYGDGD ALAAAESVFA ADTLCATAQI TAAQTSGISA
QALAAASMVD LAAAFAPDPL AGYQALVGCL RQEHGALDRT LRDQALDLAD PGGNHQAVRA
LTGGEAVVSV WNARAIVLAS YYQALARQRD PRTVLRTLLH EHHVRAVGVD PTLEKETGRL
ARASALRHLA LAGAR