Gene Francci3_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2454 
Symbol 
ID3905066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2861770 
End bp2862837 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content70% 
IMG OID637879784 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_481550 
Protein GI86741150 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.19659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTC ATTCGATCGA CCACATTGAG CTGTTCGTCG AGGACGCCGC GCAGGCTGCG 
GCCGAACTGT GCGATTCCTT CGGTTTCACC GTCACCGGCC GAGGCGGGCC GCGCACCGGG
CTCAAGGGCT GCGAGTCGGT GCTGCTGCGG CAGTGCGACA TCACTGTCGT CGTCACCGCC
GCGACCAGCT CCGACCACCG CGCCGCCGAG TTCGTCCGTC GGCACGGCGA CGGAGTGGCG
GTCATCGGCT TCGCCGTCGA CCAGGCGCAG GCCGCGTTCG CCGAGGCGGT GAATCGTGGA
GCGGTGCCGG TCACGCCGCC CGAGACCCTG GGAACGCCGG GCGGCCGCGT GACCTTCGCG
TCGGTGGCCG GGTTCGGCGA CGTGGAGCAC CGCTTCACCT CCCGGGAGGC GGTCGAGGGG
CCCTTCTCGC CCGGCCTCAT CGAGGAGACC GTCCCAGACC GCTCCAACGA AGGCCTGCTC
AGGGCCATCG ACCACGTCGC GGTCTGCCTG CCCGCCGGCG AACTGCACCC GACCGTACGC
GCCTATCGGG ACGTGTTCGG CTTCACCCGG ACCTTCGAGG AGCGCATCGT GGTCGGCTCT
CAGGCCATGG ACTCCCAGGT GGTGCGCAGC CCGTCCGGCA AGGTCACCTT CACCATCATC
GAACCGGACA CCACCCGCGC CCCCGGCCAG ATCGACGAGT TCGTCCGCTC GCACGGCGGG
GCGGGAATCC AGCACATCGC GTTCCGCACC GACGACATCA CGGCGGCGGT CCGGGACAGC
GCGAAGCGCG GGGTGCGGTT CCTCACCACC CCGGCGAGCT ACTACGAGGC GCTGCCGGCG
CGGCTCGGCC CGGTCGGCGT CCCGGTGGAG ACGCTGCGTG AGCTCAATAT CCTGGCCGAT
CGCGACCACG GCGGCGTCAT GCTGCAGATC TTCACCGCGT CCCGGCACCC CAGGCGGACC
TTTTTCCACG AGCTGATCGA CCGCCGCGGC GCCCACACGT TCGGCAGCAA CAACATCAAG
GCCCTGTACG AGGCCGTCGA ACGCCAACGG GCCGCCGAGA GCGCCTGA
 
Protein sequence
MDIHSIDHIE LFVEDAAQAA AELCDSFGFT VTGRGGPRTG LKGCESVLLR QCDITVVVTA 
ATSSDHRAAE FVRRHGDGVA VIGFAVDQAQ AAFAEAVNRG AVPVTPPETL GTPGGRVTFA
SVAGFGDVEH RFTSREAVEG PFSPGLIEET VPDRSNEGLL RAIDHVAVCL PAGELHPTVR
AYRDVFGFTR TFEERIVVGS QAMDSQVVRS PSGKVTFTII EPDTTRAPGQ IDEFVRSHGG
AGIQHIAFRT DDITAAVRDS AKRGVRFLTT PASYYEALPA RLGPVGVPVE TLRELNILAD
RDHGGVMLQI FTASRHPRRT FFHELIDRRG AHTFGSNNIK ALYEAVERQR AAESA