Gene Franean1_7005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7005 
Symbol 
ID5675316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8540033 
End bp8541691 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content68% 
IMG OID641245851 
Producthypothetical protein 
Protein accessionYP_001511242 
Protein GI158318734 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0904061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.234272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACG CGCTTCACCA CGCCCTTGAG CCGAGTGCCG AATACAATGT CGACCTGACG 
GTGACTGCCG GGCGCTGGCC CGACGGGCTG GCCGGGAACA TCTTCGTCAT CGGGCCGTCG
CAGCCGACCG CCGTCGACTT CATGTTCGCC GGGCCGGGCC TGCTCACCCA TGTGGACCTC
GAGACCAGGC ACTGGCGGAC GAAGCGGGTC GTCACGCCCG ACCTGGCTCT TCTCGGCGGC
CTCAGCCGGG CGTTGCCACC GGCCGAGCTG GCGGGACTGA CCGTCGGCGG ACGGCCGTCC
CTGACGAACG TGTCGCCGCA CTTCTTCGGC GACCGGCTAC TGCTCACCGG GCTGAGCCAG
CGGCCGGTCG AGTTCGACCC GGCGACTTTG GAGTTCAAGA CGTTTCTCGG CGGGGTCAGC
GAATACCCCG AGGTCGTCGC GCATCCCCTG TTCCCCGGTG TGCGGACGGC GGCGCACCCG
GTCGAGGACC TCGACGACGG TTGCATGTGG TGGTGCAACA CGAACCTCCG TCCCCGGGGT
TCGTCCACCT CCGACATGGA GGGACCCATG TGGGTGGTGC GCTGGGACGG GCACGGCGAC
GTCGAGACGT GGCACGTACC CGGCGCGCAC CTGACCCAGG GCGTGCACGA GATGACAGTG
ACCCAGGACT ACGTGATCTT CACGGAGATC GGGTTCCAGC CCGAGCCCGG CACCGTCGCC
GGACGCGGCC GCACCAAACC GCATCTGCCC TTCACCGACA TCTATCTGGT GGGCAAGCGC
GACCTCACCG TCGCCCGGCG GGGCCGCAGC GTGCCGGTGG CCCACGCCCG CGTCCCACGC
GAGTCGTTCC ACCACTTCGC CGACTACCGC CAGGACGGCG ACGACGTCAC GATGTACCTC
GCTCATTCGA ACGGCTGGGA TCTCAACTAC GTGCTCACCG ATGCCGACAG CGTCTGGGGA
ACCGGCTCCG GCCTCGCCAA GGGCCTGCAC GGCTTCGTCT CCGCCCCGGT GGACGCTTCA
CCGGTCGGCC GCTACGTGAT CGACGGCCGA ACCGGTGAGG TCAGGGACAG CCATGTCTTC
CTCGACCCGG AACGCCACTG GGCCACGCTT CTCTATGGCC GTGACATGCG GCGGCCGGGG
CTCGAGCGCG GCCGCTACCT GTGGCAGTCC TACTGGGGAT GCGACACCGA GATGCTGGCG
ACCCGGATCG TCGAGATGTA CCGGGATCAC CCCTATCGCG TTGTCCCGGT GGACCAGTTG
CCGTCACGGG AGATCCCGTC GTCCCTGGTG TGCATTGACC TGGAGACGAT GACCGAACAG
TCGGCCTGGT CATTCCCCGC CGGCACCACC AGCGAGTCAC CCGTGTTCGT TCCCGACCCC
GCGGGCGGCC CCGGCTGGGC GGTGATTTTT GTCCACTACT CCGACCGGAC CGAACTTCAG
GTGTTCGACG CCCTGGCTCT CGGCGCGGGG CCCGTCGCCG TCGCCACCGC CGAGGGCCTC
AAACTGTCCG TCCAGTTCCA CTCCGCTTAT CTGCCCAGCA TCCGTCTACG TGACACGGGC
TACGAACGTT CGTTCGCGGC CGACCTCGGC GACGGCTGGC GGGATTTCTC GCCGTCCGCC
CGCGGTGTGA TCAGTAAGGT GCTCGAGCGG TACGGCTGA
 
Protein sequence
MSHALHHALE PSAEYNVDLT VTAGRWPDGL AGNIFVIGPS QPTAVDFMFA GPGLLTHVDL 
ETRHWRTKRV VTPDLALLGG LSRALPPAEL AGLTVGGRPS LTNVSPHFFG DRLLLTGLSQ
RPVEFDPATL EFKTFLGGVS EYPEVVAHPL FPGVRTAAHP VEDLDDGCMW WCNTNLRPRG
SSTSDMEGPM WVVRWDGHGD VETWHVPGAH LTQGVHEMTV TQDYVIFTEI GFQPEPGTVA
GRGRTKPHLP FTDIYLVGKR DLTVARRGRS VPVAHARVPR ESFHHFADYR QDGDDVTMYL
AHSNGWDLNY VLTDADSVWG TGSGLAKGLH GFVSAPVDAS PVGRYVIDGR TGEVRDSHVF
LDPERHWATL LYGRDMRRPG LERGRYLWQS YWGCDTEMLA TRIVEMYRDH PYRVVPVDQL
PSREIPSSLV CIDLETMTEQ SAWSFPAGTT SESPVFVPDP AGGPGWAVIF VHYSDRTELQ
VFDALALGAG PVAVATAEGL KLSVQFHSAY LPSIRLRDTG YERSFAADLG DGWRDFSPSA
RGVISKVLER YG