Gene Francci3_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4022 
Symbol 
ID3906983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4808490 
End bp4809509 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content64% 
IMG OID637881351 
Productglyceraldehyde-3-phosphate dehydrogenase 
Protein accessionYP_483101 
Protein GI86742701 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATCA TGACAATCAC TGTGGGAATC AATGGCTTCG GCCGTATCGG CCGTAGCTAC 
TTCCGTGCGC TGCTCACTTC CGGTGCCGAC ATCCGCGTGG CCGCGGTCAA CGACCTGACC
AGCGCGCAGA GTCTGGCGGA CCTGCTGAAG TACGACAGCG TGTACGGGCC GCTGCCGCAG
CAGGTCGCGG CGGAGGGCTC ATCGCTCAGG GTCGGGGACA CCGTGGTCGA GGTTCTCAGC
GAACGCGACC CGGCACAGCT GCCGTGGCGC CGCCTCGGAG TCGACGTTGT CATCGAGTCG
ACGGGCGTGT TCAACGACGC GGCCAAGGCC CGGGCGCATA TTGATGCCGG CGCCTCGAAG
GTGGTCGTCT CTGCTGCGGC AAAGAACGCG GATCTCACCC TCGTCATCGG TATCAACGAT
GACCTGTACG ACCCCCAGAA GCACACGGTC GTCTCCAACG CTTCGTGCAC GACGAACTGC
CTGGCTCCCA TGGCCCGGGT GCTCGATGAC GGGCTCGGCA TCGAGTGCGG CACCATGACC
ACGATCCACG CCTACACGCA GGATCAGAAC CTGCAGGACG GCCCGCACCC CGACCCCCGG
CGCGCCCGTG CGGCCAACCT CAGCACCATC CCGACCACCA CCAATGCCGC CAGTGCGATC
GGCCTCGTGC TCCCAAACCT GAAGGGCAAG CTCGACGGGT ACTCGGTGCG GGTTCCTGTT
CCCGTTGGCT CGCTGACCGA CCTGACCGTC CGGGTGGACC GTGAGACGAC GGTGGAGGAG
GTCAACTCGC TTTTCCGCAA GGCGGCGGAC GGTGAACTTG CACGAATCCT GCGCTACACT
GCAGACCCGG TCGTTTCCGC GGATATCGTC AAGGATCCGG CGTCATGCAT CTTCGACTCC
CTGCTCACGC AGGTTATCGA GGGGCGCCAC GTACACATCT TCGGCTGGTA CGACAACGAG
TGGGGATTCT CCAACCGCCT TATAGACACA ACCCAGTTGG TCGGCGGCGC GACTGCATGA
 
Protein sequence
MGIMTITVGI NGFGRIGRSY FRALLTSGAD IRVAAVNDLT SAQSLADLLK YDSVYGPLPQ 
QVAAEGSSLR VGDTVVEVLS ERDPAQLPWR RLGVDVVIES TGVFNDAAKA RAHIDAGASK
VVVSAAAKNA DLTLVIGIND DLYDPQKHTV VSNASCTTNC LAPMARVLDD GLGIECGTMT
TIHAYTQDQN LQDGPHPDPR RARAANLSTI PTTTNAASAI GLVLPNLKGK LDGYSVRVPV
PVGSLTDLTV RVDRETTVEE VNSLFRKAAD GELARILRYT ADPVVSADIV KDPASCIFDS
LLTQVIEGRH VHIFGWYDNE WGFSNRLIDT TQLVGGATA