Gene Franean1_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1494 
Symbol 
ID5669898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1794993 
End bp1796144 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content74% 
IMG OID641240414 
Productbifunctional uroporphyrinogen-III synthetase/response regulator domain protein 
Protein accessionYP_001505840 
Protein GI158313332 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.074261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACT CCGGCCCGGT GACGGCGCCC GTCGAGCCGC TCGCCGGTTA CACGGTCGCG 
CTCACCGCGG CGCGCCGGCG TGAGGAGTTC GGCGCGGCCC TGGAGCGACG CGGCGCCAAG
GTCGTCTACG CCCCCGCCAT ACGCATCGTG CCGCTCGCGG ACGACGCCCG GCTGCGGGAG
GCCACCGAAC GCTGCATCGC CGCACCCGTG GACGTCGTCG TCGCCACCAC CGGAATCGGC
TTCCGGGGCT GGGTCGACGC GGCCGAGACG TGGGGTCTCG CCGACCGGCT GGTCGCGGCG
TTCGAGTCGG CGGACCTGCT GGCACGCGGG CCGAAGGCGC GCGGTGCGAT CCGGGCAACC
GGGCTGCGTG AGGCGTGGTC ACCCGAGTCG GAGTCATCCT CCGAGGTCAT GTCATACCTG
ACCGCCCACG GTGGCCTGGA CGGCAAGCGG ATCGCGGTGC AGCTGCACGG CGAGCCACTG
CCGGACATGG TCCAGACGCT GTGCGCGGCA GGCGCCGAGG TCATCGAGAT TCCCGTCTAC
CGGTGGGTCC CGGCGCAGGA CATGGCTCCC GTCCGCCGCG TGGTGGAGTG CGTCGCGGCA
CGGTCGCTGG ACGCGGTCGC GTTCACGAGC GCGCCCGCCG CCGCGAGCTT CCTGCAGACA
GCCGACGAGA TGGCGCTGCG ATCCGCCGCC GCGGAGGCGA TGCGCGGCCC GGTTGTCGCC
GCCTGCGTCG GCCCGGTGAC GGCCGCTCCC CTCGGGCGGG CTGGGATTCC GTGCGTGATC
CCGTCGCGGG GACGTCTGGG CGCGCTTGTC CGGGAGATCG TGGAGCAGGT GCCCATCCGG
CGTGGCCTGC GACTGCGCGT CGGCGAGCGC GCGCTGGACG TCCGTGGCCA CGCCGTGGCC
GTCGACGGTG TGCTCGTCGC GCTGCCCGCC GCCTCGATGA CGCTGCTGCG CGCCCTGGCG
GCCAGGCCTG GCTATGTCGT CTCCCGGGCG GACCTGCTCA ACCTGACCGG CACGACCGAC
GAGCACGCGC TCGAGGTCGC CGTGGGCCGC CTGCGCACGT CGCTCGGCGA CCCCGCCCTC
ATCCGGACCG TGGTGAAGCG CGGATACCGG CTCGACTGTG AGCCGGTATC CGCCTCCTCC
GGGTGCCTCT AG
 
Protein sequence
MTDSGPVTAP VEPLAGYTVA LTAARRREEF GAALERRGAK VVYAPAIRIV PLADDARLRE 
ATERCIAAPV DVVVATTGIG FRGWVDAAET WGLADRLVAA FESADLLARG PKARGAIRAT
GLREAWSPES ESSSEVMSYL TAHGGLDGKR IAVQLHGEPL PDMVQTLCAA GAEVIEIPVY
RWVPAQDMAP VRRVVECVAA RSLDAVAFTS APAAASFLQT ADEMALRSAA AEAMRGPVVA
ACVGPVTAAP LGRAGIPCVI PSRGRLGALV REIVEQVPIR RGLRLRVGER ALDVRGHAVA
VDGVLVALPA ASMTLLRALA ARPGYVVSRA DLLNLTGTTD EHALEVAVGR LRTSLGDPAL
IRTVVKRGYR LDCEPVSASS GCL