Gene Franean1_0169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0169 
Symbol 
ID5668594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp202782 
End bp203795 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content77% 
IMG OID641239098 
Product2-amino-4-hydroxy-6- hydroxymethyldihydropteridine pyrophosphokinase 
Protein accessionYP_001504542 
Protein GI158312034 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0801] 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase
[COG1539] Dihydroneopterin aldolase 
TIGRFAM ID[TIGR00525] dihydroneopterin aldolase
[TIGR00526] FolB domain
[TIGR01498] 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0060964 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.143037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTC CGGCCGGCGG TGTCCCGACC GGCGGTGTAG CGGCAGACGC GGCCCGGCTC 
GACCGGATCG CCCTGACCGG CCTGCGCGTG CACGGCCGGC ACGGGGTGCT GCCCGCGGAA
CGGGAGCTCG GGCAGTTCTT CGTCGTCGAC GCCACGCTCT GGCTGGACAC CGGCCCGGCG
GCGGCCACCG ACGACCTCAC CCTCACCGTG CACTACGGCG AGCTGGCGCA GGCCCTCGCC
GACATCGTCG CCGGCGAGCC CGTCGAGCTG ATCGAGACCC TGGCGCACCG GCTGGTCCGG
GCCTGCCTGG CGGCCGACCC GCGGGTGGCC GCCGCCGAGG TGACCGTGCA CAAGCCCGCC
GCCCCGATCA CCGTCGCGTT CACGGACATC GCCGTGACGG TGGCCCGCTC CCGGGACGCC
GCCGTGACCA CGGGCCCTTC CCGGGAGGTG GCCGCGACGT GCTCCCGGGA CGTCGCCGCG
GCCGCCGGGC CGTCCGCCGC GGGCGCGAGT GCGAGTGCGG GGCACGCGGT GGTCGCCATC
GGTTCCAACA TCGGCGACCG GCTCGCCCAC CTGCGGGGCG CCGTCCGCGA CCTGGACGCC
AGGCTGGGCG TGCTCGCCGT CTCCGCCGTG TACGAGACCG AGCCGGTCGG CGGCCCCGAG
CAGGACGACT ACCTCAACGC GATCGTCCTG GTCGGCGCCG CGCCGCCGCG CGACCTGCTC
GCCGCGGCCC GCGCCGCCGA GCGGACCGCC GCGCGCGAGC GCACCATCCG CTGGGGCCCC
CGCACGCTGG ACGTCGACAT CATCGCCTGC GGCGACACCC GCAGCGACGA TCCGGAGATC
CTGCTGCCGC ACCCGCGGGC GCACGAACGC CTGTTCGTCT GCGTCCCCTG GCTGGACGTC
GAACCGGACG CGATGCTGCC CGGGCACGGC CGGGTGGCGG ACCTGGTCGC GCGCCTCTCG
GTGGTGCCGG GAGCGGCCGC GCTCCGGCGC ACCCCCCACG GGCTCGGTCG GTGA
 
Protein sequence
MTAPAGGVPT GGVAADAARL DRIALTGLRV HGRHGVLPAE RELGQFFVVD ATLWLDTGPA 
AATDDLTLTV HYGELAQALA DIVAGEPVEL IETLAHRLVR ACLAADPRVA AAEVTVHKPA
APITVAFTDI AVTVARSRDA AVTTGPSREV AATCSRDVAA AAGPSAAGAS ASAGHAVVAI
GSNIGDRLAH LRGAVRDLDA RLGVLAVSAV YETEPVGGPE QDDYLNAIVL VGAAPPRDLL
AAARAAERTA ARERTIRWGP RTLDVDIIAC GDTRSDDPEI LLPHPRAHER LFVCVPWLDV
EPDAMLPGHG RVADLVARLS VVPGAAALRR TPHGLGR