Gene Francci3_4457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4457 
Symbol 
ID3907433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5329181 
End bp5330308 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content76% 
IMG OID637881789 
Productglycerate kinase 
Protein accessionYP_483532 
Protein GI86743132 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1929] Glycerate kinase 
TIGRFAM ID[TIGR00045] glycerate kinase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGGG TACCGTCCCC CCACCGGCCC GCGCGGGTGG TCCTCGCACC CGACTCGTTC 
AAGGGATCGG CGACCGCGGC CGAGGTGGTC GCGGCGCTGG GCGCGGGCTG GCGCTCCGAG
CGGCCCCGCG ATCAGGTCGT GGGAGTACCG ATCGCCGACG GTGGCGAGGG CACCCTCGAC
GTCTTCGCCG TCGGCGTACC CGGGTCGGAG CGGCACCCGG CACGGGTGAC TGGTCCGGAC
GGGCGGGGCC ACGACGCCGA ATGGCTGTCG CTGCCCGATG GCACCGCAGT GATCGAACTG
GCGAGGGCGA GCGGGTTGCC GCTGATGCGG GAACTCGATC CGTTGGGCGC GCAGACCGTG
GGCCTCGGCG AGTTGGTCGC CGCCGCGATC GACGCCGGAG TCGACCGCGT TCTCATCACT
TTGGGCGGTT CCGCCTCGAC CGACGGTGGT ACCGGTGCCC TCGCCGCGTT GGGTGCCCGG
TTCCTCGACG CGGCCGGCCG GCCGTTGGCC GTCGGCGGCG GCGCCCTCAC GTCGCTCGCG
CACATCGATC TGACCGGGTT GCGGCCGGCG CCGGCCGGGG GAGCGTACTG TCTCGTCGAC
GTCGACGCGC CCCTGCTGGG CCCTGCCGGC GCCGCCGCCG TATTCGGGCC GCAGAAGGGT
GCCGGACCCG CGGACATCAC CCGGCTGGAG GAGGGGCTGC GCAGGCTCGC CCACCTGTTG
GGTGGTGCTC CCGACGCCGC GGGAGCCGGG GCTGCCGGCG GCACCGCCTA CGGCCTCGCC
GCGGCCTGGG GAGCCGAGGT GGTCCCCGGC CTGCCGACGA TCATCCAGGC GGCGGGCCTG
CCGGCGGCCC TTGCCGGCGC CGACTGGGTG GTGACCGGCG AGGGCAGGTT CGACCGCACC
TCGTTGTCCG GCAAGGTGGT CGGCGGAGTC CTCGCGCTGG CGCGGGACGC CGAGGTGCCC
GTGCTCCTGG TGGCTGGGCG GGTTGACGCC CCCCGACCGG AGGGGGTCCG GGCCGAGATC
GCCCTCGTCG ATCTCGCCGG GGGGCCGGCC ATGGCGATGG CCGAGCCGCT CCGGTGGCTG
CGGAGGGCCG GCGCCGAGCT GGCCCGCCGG GTCACGACCG CCCGGTGA
 
Protein sequence
MTGVPSPHRP ARVVLAPDSF KGSATAAEVV AALGAGWRSE RPRDQVVGVP IADGGEGTLD 
VFAVGVPGSE RHPARVTGPD GRGHDAEWLS LPDGTAVIEL ARASGLPLMR ELDPLGAQTV
GLGELVAAAI DAGVDRVLIT LGGSASTDGG TGALAALGAR FLDAAGRPLA VGGGALTSLA
HIDLTGLRPA PAGGAYCLVD VDAPLLGPAG AAAVFGPQKG AGPADITRLE EGLRRLAHLL
GGAPDAAGAG AAGGTAYGLA AAWGAEVVPG LPTIIQAAGL PAALAGADWV VTGEGRFDRT
SLSGKVVGGV LALARDAEVP VLLVAGRVDA PRPEGVRAEI ALVDLAGGPA MAMAEPLRWL
RRAGAELARR VTTAR