Gene Franean1_0805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0805 
Symbol 
ID5669221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp942152 
End bp943462 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content77% 
IMG OID641239733 
Productnucleoside triphosphate pyrophosphohydrolase 
Protein accessionYP_001505169 
Protein GI158312661 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0143961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCC GCATCACCCT GGTCTCGACC AGCGCCCGCG TGGCCCCCGG CCTGCTCACC 
GCCGCGGCCT GGGACGTGCT CCGCTCGGCC CGGGTCTGGA CGGCGAGCCC GGAGCATCCC
CAGGCCGCGG CGCTGCGTGA GGCCGGCGTC GGCGTCTCGG TGCTGCGCCC CGCTCCGCCG
TCCGAACCCG GCGTGGGGCT GTCGGCCGAG GCCGGTGTGG TGCCGTCGGG CCGGGCGGAG
GCTGATCCGG TCGCCGAGCT GCGGGCGGTG GCCGGGCCGG GCACCCATGT CGCGTGGCTG
CTCGACCCGG TCTCGCCGAC AGCCGCCGAC CGCGCGCTGC GCGCCGCCCT CACCGACCAG
GACCACCCGG CGGAGCCGGG GGCCGTCGTC GAACTGCTCG TCGCCACCCG CGAGCTGCCC
GGGTCGGCGC TGCTCGACGC GGTCGCGGTC ATGGACCGGC TGCGCTCGCC CGGCGGCTGC
CCCTGGGACG CCGAGCAGAA CCACGTCTCG CTGGCCCCCT ACCTGCTCGA GGAGGCCTAC
GAGGCCTACC AGGCCATCGA GGACGGGGAT CTCGCGGAGC TGCGCGAGGA GCTGGGCGAC
GTCCTGATGC AGGTGCTCTT CCACGCGCGG ATCGCCGCCG AGTCCGGCGG GGCGGGCTGG
GACGTCGACG ACGTCGCGGC CGGGCTGACC GCCAAGCTGA TCCGCCGCCA CCCGCACGTG
TTCGGTGACG TCGCCGTCTC CGGCGCGGAC GACGTCGTCA CCAACTGGGA TGCGATCAAG
GCCCAGGAGA AGGGCCGGAA GTCGGTGACG GAGGGTGTGC CGCTCTCCGC GCCGGCGCTC
TTCCTGGCCG CCAAGCTGCT ACGGCGGGCC GCGAAGCTCG GGCTCCCGCC GGAGCTGGCC
CTTCCCCGCC CCTCGGCCGA CAGCGGCGTC GGGGATGCCG GGGCCGGTCT ACCCGGCCTC
GTCGCTGCGC TGGCCCGGGA GGTCGGGACC GCTCGCCCGG GAGATCGGGC GTCCGCTGAT
CAGAGTGACG GTCCTGGTAC CGAGGCGGGC ACCACCGCCG AGGAGCGGAT CGGGGACCTG
CTGTTCGCCG CGGTCGTGCT GGCCGGGGAG GAGGGGGTCG ACCCGGAGAC CGCGCTGCGC
GCACGGGCCC GGCTGTTCCG GGACACGCTG GCCCGGGCCG AGCACGCCGC CCTCGCCCGC
GGCGAGGAGC CCCGCGGGCT GGCTGCCGAC ATCTGGCGAT CACTGTGGGT GTCCGCGAGC
GTTCCCCCGG GTGGGCCGGC AGCCACGGAC GGGCCTGTGC ACGGGGCCTG A
 
Protein sequence
MTTRITLVST SARVAPGLLT AAAWDVLRSA RVWTASPEHP QAAALREAGV GVSVLRPAPP 
SEPGVGLSAE AGVVPSGRAE ADPVAELRAV AGPGTHVAWL LDPVSPTAAD RALRAALTDQ
DHPAEPGAVV ELLVATRELP GSALLDAVAV MDRLRSPGGC PWDAEQNHVS LAPYLLEEAY
EAYQAIEDGD LAELREELGD VLMQVLFHAR IAAESGGAGW DVDDVAAGLT AKLIRRHPHV
FGDVAVSGAD DVVTNWDAIK AQEKGRKSVT EGVPLSAPAL FLAAKLLRRA AKLGLPPELA
LPRPSADSGV GDAGAGLPGL VAALAREVGT ARPGDRASAD QSDGPGTEAG TTAEERIGDL
LFAAVVLAGE EGVDPETALR ARARLFRDTL ARAEHAALAR GEEPRGLAAD IWRSLWVSAS
VPPGGPAATD GPVHGA