Gene Francci3_0617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0617 
Symbol 
ID3903485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp700552 
End bp702474 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content69% 
IMG OID637877950 
Productglucosamine--fructose-6-phosphate aminotransferase 
Protein accessionYP_479730 
Protein GI86739330 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains 
TIGRFAM ID[TIGR01135] glucosamine--fructose-6-phosphate aminotransferase (isomerizing) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.577604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGGA TCATCGGTTA CGTCGGCGAC CAGGCGGCCC TGGATGTCGC GTTGAACGGT 
CTGCGGCGGT TGGAGTACCG CGGTTACGAC TCGTCGGGTG TCGCCGTGGT CGGCGCCGGG
GCACTGCGGA CGGCGCGTCG TGCCGGCAAG CTGTCCAACC TCGAGAAGCT GCTCGCCGAG
AATCCGGATT CCAGGCCGAT GCCCGGCACC ACAGGCATGG GGCATACCCG GTGGGCCACG
CATGGTGGAC CAACCGACGC GAATGCCCAT CCGCACACCG ACTGCACCGG CGCCATCGCC
GTCATCCACA ACGGGATCAT CGAGAACTTC GCCGCGTTGC GGGTCGAGCT GGAGACGGTC
GGCCACGAGC TCGCCAGTGA GACCGACACG GAGGTCGTCG CCCATCTCCT CGAAGTCGAG
CTTGCCGGCA CCGGCACCGG CACCGGCACC GGTGCTGGTG CTGGTGCTGG TGCCGACCGC
GGGTACGACG GGGGCGCGCA TCCGCTGACC GTCGCGCTGC GACGGGTCTG CCGGCGGCTG
GAAGGCGCGT TCACCCTGGT CGTGCTGCAC CGCGACTTCC CGGAGGTGGT TGTCGGGGCC
CGGCGTAACA GCCCGCTGGT CGTGGGGCTC GGTCAGGGGG AGACCTTCCT CGCCAGCGAC
GTCTCGGCCT TCATCGCCCA CACCCGGGAG GCGCTGGAGA TCGGGCAGGA TCAGGTGGTC
GAGGCCCGCC GGGACGGTGC GACGGTGACC GACTTCGGCG GCCAGATCAT CGAGGGTCGG
CGCTACCACG TCGACTGGGA TGCCAGCGCC GCCGAGAAGG GCGGCTACCC GTACTTCATG
CTCAAGGAGA TCAGCGAGCA GCCGACCGCC CTGGCAGACA CGCTGCGCGG TCGGCTTTCC
GCCGACGGTG CGATCGTGCT CGACGAGGAG CGGCTGTCTG GCCAGGATTT CCGGGACGTG
GACAAGGTCT TCATCGTCGC CTGCGGCACC GCCTACCACG CCGGCCTCAT CGCGAAGTAC
GCAATCGAGC ACTGGACCAG GCTGCCCTGC GAGGTCGAGA TGGCTTCGGA GTTTCGCTAC
CGCGACCCGG TGCTCGACCG GTCCACCCTC GTCATCGCGA TCAGCCAGAG CGGCGAGACG
CTGGACACGC TGATGGCGGT CAAACACGCG CGCGAGCAGA AGGCGCGGGT GCTGGCCATC
TGCAACACCA ACGGTTCTAC GATCCCTCGG GAGTCCGATG CGGTGCTGTA CACCCGCTGC
GGCCCGGAGG TGGGGGTCGC GTCCACCAAG ACCTTCCTCG GGCAGGTCGC CGCGTGCCTT
CTCGTCGGCC TGTTCTTGGC CCAGGTGCGC GGCGTGCTCT ACGGCGACGA GGTCGCCGCG
TACGTCGAGC GGCTCCAGCG CATGCCCGAG CTGATCGAAC GGACGCTGGG AACGGTCGAG
CCGGTGCGGG AGCTCGCCCG CGCCCTGGCC GACGCGAAGG CGGTGCTATT CCTCGGTCGT
CATGTGGGGT ACCCGGTGGC CCTGGAAGGG GCCCTCAAGC TCAAGGAACT CGCCTACATG
CACGCCGAGG GGTTCCCGGC GGGGGAGCTC AAGCACGGGC CGATCGCGCT CATCGAGCCC
GGCCTGCCGG TGTTCGTCGT TGTGCCGTCC CCGCGGGGTC GTGCCAACCT GCACGGAAAG
ATCGTTTCCA ACATCCAGGA GGTCCGGGCG CGTGGCGCCC GCACGATCGT GATCGCGGAA
GAGAGCGACA CCGCCGTCGA ACCGTTCGCC GATCATCTCA TCCGCGTGCC CGCGACCGCG
TCGCTGTTCG CGCCCCTGGT CACGACGCTA CCGCTGCAGG TCTTCGCGTG CGAGCTGGCG
CTTGCGCGCG GCCTCGATGT CGATCAGCCG CGTAACCTGG CGAAGTCGGT GACAGTCGAG
TAA
 
Protein sequence
MCGIIGYVGD QAALDVALNG LRRLEYRGYD SSGVAVVGAG ALRTARRAGK LSNLEKLLAE 
NPDSRPMPGT TGMGHTRWAT HGGPTDANAH PHTDCTGAIA VIHNGIIENF AALRVELETV
GHELASETDT EVVAHLLEVE LAGTGTGTGT GAGAGAGADR GYDGGAHPLT VALRRVCRRL
EGAFTLVVLH RDFPEVVVGA RRNSPLVVGL GQGETFLASD VSAFIAHTRE ALEIGQDQVV
EARRDGATVT DFGGQIIEGR RYHVDWDASA AEKGGYPYFM LKEISEQPTA LADTLRGRLS
ADGAIVLDEE RLSGQDFRDV DKVFIVACGT AYHAGLIAKY AIEHWTRLPC EVEMASEFRY
RDPVLDRSTL VIAISQSGET LDTLMAVKHA REQKARVLAI CNTNGSTIPR ESDAVLYTRC
GPEVGVASTK TFLGQVAACL LVGLFLAQVR GVLYGDEVAA YVERLQRMPE LIERTLGTVE
PVRELARALA DAKAVLFLGR HVGYPVALEG ALKLKELAYM HAEGFPAGEL KHGPIALIEP
GLPVFVVVPS PRGRANLHGK IVSNIQEVRA RGARTIVIAE ESDTAVEPFA DHLIRVPATA
SLFAPLVTTL PLQVFACELA LARGLDVDQP RNLAKSVTVE