Gene Franean1_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1289 
Symbol 
ID5669702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1555208 
End bp1556572 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content74% 
IMG OID641240221 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_001505649 
Protein GI158313141 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGTCT CGGTTTTCCC GTCCGATGAC GGTCCTGGTC CGGACGATCA GCCCGGGCCG 
GGGGGCGCCG GCGACCGGCG CGGCGAGACG CCCTCGCCGA ACGGGCAGGC TGCGGCCGGG
CCCGCGGCGC ACCGGCGGCA CGATCCGCTG GGCGATCGCT GGATCATCGT CTCGGCGGGC
CGGGTGGCGC GCCCGTGGCG CGGCGGCCAG GAAGTGGTCG CCACCGCCGT GGGGACCTAC
GACCCCGAAT GCCATCTCTG CCCGGGAAAC GTGCGCGCCT CCGGCAAGGC GAATCCCGAC
TACGACGGCG TTTACGTGTT CGACAACGAC TTCCCGGCGC TGCGCCCCGA GCCCGCGGAC
CTGGCCGCGT CTCCCGATCA GGCGGTGTCT CCCGATCAGG CGGTGTTTGC TGATCCGTCC
GGGCTGGATG CGCGGGCGGC GGGGCTGCTG CACGCCGAGC CGGGCATCGG CACGTGCCGC
GTCGTCTGCT TCGACCCGGG CCACGACCAG TCACTGGCGA GCCTGGGGCT CGACCGGGTG
CGGGCGGTGG TCGACACCTG GGCCGACCAG GAGCGTGAGC TCGGCCGGAC CTGGAACTGG
GTGCAGATCT TCGAGAACCG CGGCGCCGCG ATGGGGGCGT CCAGCCCGCA CCCGCACGGG
CAGATCTGGG CGTCGTCGTT CCTGCCCGAC CTCGCCGCGG TCGAGGACCG CACCCAGCGC
GACCATCTAC GGTCCAGGGG CACCTCGCTG CTGGTCGACT ACGCGGCGCT GGAGGCTGAG
CTCGCCGCCG CGGATGACGG CGCCAGCACC GCCGCCGCCA GCAGAGTCGC TGCTGGCAAG
GCCACCAACG GCAAAGCCAC CAATGGCAAA GCCACTGACG GCGTGGACGG CCCCGGCCGG
TCCGCGAGCC GGGTGGTCGT GGCGAACGAC CACTGGCTCG TCGTCGTCCC CTACTGGGCG
TTCTGGCCGT TCGAGACGCT GGTGCTCCCG CGCCGCCCGG TCGGCCTGCT CCAGCACCTG
ACCGACGCCG AGCGCGACGC GCTGGCACAG GCGCTGCGGT CCCTGCTCAG CTGCTACGAC
GCGCTGTTCG ACTCGCCCTT CCCCTACTCG ATGGGCTGGC ACGCGGCGCC CGGCGCGCGC
CCGGATCCCG ACGCGCCGGT GCCCGCGCAC TGGCAGCTGC ATGCCCACTT CCACCCGCCG
CTGCTGCGGT CACCGACCGT CCGCAAGCAC CTGGTGGGCT ACGAGATGTT CGCAGGCGTG
CTACGCGACA TCACCCCGGA GGACGCCGCC GCCCGGCTGC GGGCCGCCGG GGCGCGCGCG
GAGAAACGGA CAACAGAGCC CACACGGGGG CTCATGCCGG GATGA
 
Protein sequence
MLVSVFPSDD GPGPDDQPGP GGAGDRRGET PSPNGQAAAG PAAHRRHDPL GDRWIIVSAG 
RVARPWRGGQ EVVATAVGTY DPECHLCPGN VRASGKANPD YDGVYVFDND FPALRPEPAD
LAASPDQAVS PDQAVFADPS GLDARAAGLL HAEPGIGTCR VVCFDPGHDQ SLASLGLDRV
RAVVDTWADQ ERELGRTWNW VQIFENRGAA MGASSPHPHG QIWASSFLPD LAAVEDRTQR
DHLRSRGTSL LVDYAALEAE LAAADDGAST AAASRVAAGK ATNGKATNGK ATDGVDGPGR
SASRVVVAND HWLVVVPYWA FWPFETLVLP RRPVGLLQHL TDAERDALAQ ALRSLLSCYD
ALFDSPFPYS MGWHAAPGAR PDPDAPVPAH WQLHAHFHPP LLRSPTVRKH LVGYEMFAGV
LRDITPEDAA ARLRAAGARA EKRTTEPTRG LMPG