Gene Franean1_0750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0750 
Symbol 
ID5669166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp875208 
End bp876269 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content69% 
IMG OID641239677 
Productpolysaccharide pyruvyl transferase 
Protein accessionYP_001505114 
Protein GI158312606 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5039] Exopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.759816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000135712 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGGAGG CCGCGACAGG GGTCCCGTAT GCCTCGACGG CAGCGGTCGC ATCCACTGCC 
CGCCTGATCG AGAGCCTGTC GACGCGGGCC ACGGCCGCCG TCGCCGACCT GCTGCCGCCG
GGGACGACCG ACGTGGCGCT CCTCGACTTC CCGTACCACC GCAACAGTGG TGACTCGGCC
ATCTGGCTCG GCGTACGCCA GATTCTCCGA CAGCTGGGAG TAAGGGTCGC CTACGTCGCG
GACACCGCGC GCTACCGGCC GGACCGGTTA CGGGAGGCAC TGCCGGAGGG GCCTGTCCTG
TTACTCGGCG GGGGGAATTT CGGTGACCTC TGGCCGGGGC ATCAGGAGCT CCGGGTCCGG
GCCCTGCGTG ACTTCCCGGA CCGTACCGTG ATCCAACTTC CGCAGAGTAT TTCCTTCCGT
AGCCGGACGG CGCTTTCCGA GGCCCAGCGC GTGACCGCGG CCCACCCACA TTTCACTCTT
ATGGTCCGTG AACGGCGCAG CCTTTCGTTC GCCACCGAGA ACTTCGATGT GCCGCTCGTG
TTCGCGCCGG ACTCGGCCCT GGCCAACGGC CCGCTGAGAC CGCCGTGGCC GGTACGTCCC
GACGGCGTGC TGTGTCTTGC CCGCGATGAC GTTGAGGGCA CCGGAGCCAT CGCGGGCGTG
GCCGGCCCCG GCATACGTCG GGCCGACTGG GGAATGCGTG GCCTGCCCGC GGCGCGGTGG
AGAGCGGCGA AGGCGCTGCC GAAACTCGAG CGGCATGCCC TGGCCCGGCA TCCGGATGCG
CACCGGGCGG CCCGCCCCGC GCTGCACGCG GCCTATGAAA CAATGGCGAA GACAAGCGTG
GGAACGGCGA CGAGCCTCCT GTGCACCGCG CGTTATGTGG TGACCGACCG GTTGCACGCG
CACATTCTCT GCCTGCTTCT CGGTATCCCC CACTCGGTGT TTGACAACAG CTACGGAAAG
GTCAGTGGGA CCTTCGAGGC GTGGACCTCG GACAGCGACC TCGTGCACTG GGCGACCAGC
GCGGACGAGG CCCTGGAGAG GTGCCGGGAG TTCGCGCCGT GA
 
Protein sequence
MTEAATGVPY ASTAAVASTA RLIESLSTRA TAAVADLLPP GTTDVALLDF PYHRNSGDSA 
IWLGVRQILR QLGVRVAYVA DTARYRPDRL REALPEGPVL LLGGGNFGDL WPGHQELRVR
ALRDFPDRTV IQLPQSISFR SRTALSEAQR VTAAHPHFTL MVRERRSLSF ATENFDVPLV
FAPDSALANG PLRPPWPVRP DGVLCLARDD VEGTGAIAGV AGPGIRRADW GMRGLPAARW
RAAKALPKLE RHALARHPDA HRAARPALHA AYETMAKTSV GTATSLLCTA RYVVTDRLHA
HILCLLLGIP HSVFDNSYGK VSGTFEAWTS DSDLVHWATS ADEALERCRE FAP