Gene Francci3_0094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0094 
Symbol 
ID3905138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp115384 
End bp117018 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content71% 
IMG OID637877424 
Productamidophosphoribosyltransferase 
Protein accessionYP_479217 
Protein GI86738817 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.971406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGACG TCGCACCCCG CTTCGGCGCC ACCCCCGGAT GGGAATCAGA CCTCTCCGAC 
GAGCCGGGGC CGCGTGACGC CTGCGGTGTG TTCGGGGTGT GGGCGCCCGG TGAGGACGTG
GCGAACCTGG CCTACTACGG TCTGTACGCG CTGCAGCACC GCGGGCAGGA GGCTGCCGGC
ATCGCGGTCG GGGACGGTCG GACCGTGGTG GTCTTCAAGG AACTGGGGCT TGTCGCTCAG
GTCTTCGACG AGGTCACCCT GTCCAGCCTG AGCGGCCATG TGGCGGTCGG TCACACCCGG
TACTCGACGA CGGGCTCCTC CACCTGGGAG AACGCCCAGC CGTCGTACCG CACCGCCCGC
TTCGGCGGCC CCATCGCGCT CGGGCACAAC GGCAACCTCA CCAACATCGT CGAGCTCGCG
CGGACGCTCG GCGCGGAGCG GGACCGCCTG CGGGCGACCA CCGACTCGGA CCTCATCACC
GCCATGCTCG CCGACCATCC CGGCCCGACC CTCGTCGACG CGGCGATGGA CGTGCTGCCC
CGCCTGACCG GGGCCTTCTC GCTGGTCTTC TCGGACGCCT CGACGCTGTA TGCCGCCCGG
GACGCGCACG GGATCCACCC GCTCGTGCTC GGCCGGCTCG ACGGCCATCC CGACGGCGCG
TGGATCATCG CCAGCGAGAC GGCCGCCCTC GACATCGTCG GCGCCACCCT CGTACGGGAG
ATCGTTCCCG GCGAGCTGGT CGTCATCGAC GCCGAGGGGG TGCGTTCGCG CACGTTCGCC
CGGCCCGATC CGCACGGCTG CCTGTTCGAG TACGTCTATC TCGCCCGCCC GGACACGTCG
ATCGCCGGAC GTTCCGTGCA CGCCACCCGG GTCGATGTTG GCCGGCAGCT GGCCCGCGAG
GCGCCGGTGG AGGCCGACCT GGTCATCCCC GTGCCGCAGT CCGGGGTACC CGCCGCGGTC
GGCTACGCGG AGGAGTCGGG TATCCCGTTC GGTGAGGGCC TGGTCAAGAA CTCCTACGTC
GGGCGCACCT TCATCCAGCC CTCGCAGACG ATTCGGCAGC GCGGGATCCG GCTCAAGCTG
AACCCCCTGC GGGATGTGAT CGAGGGGCGT CGCCTCGTCG TGGTGGACGA CTCGATCGTC
CGCGGCAACA CGCAGCGCGC GCTGGTGCGC ATGCTGCGGG AGGCCGGCGC CGCCGAGGTC
CACATCCGGA TCTCCTCGCC GCCGGTACGG TGGCCCTGCT TCTACGGCAT CGACTTCGCG
ACCCGCGCCG AGCTGATCGC CAGTGAGGCG GGCATCGAGG AGATCAGAGC CTCTCTCGGG
GCCGACTCGC TGGCCTACGT CTCGTTGGAG GGCCTCGTCG AGGCCTCCCG CCAACCGGCC
GGCACGCTCT GCCGGGCCTG CTTCGACGGC GTGTACCCGG TTCCGCTGAA CGATTCCGAC
AAGCTCGGCA AGCATCGGCT GGAGGCGACG GGCGGCGCGG AGACCACCGA GGACATGATC
GCCGAGGCGT TGCGGCGCGG CGTGACGCTG GGCATGAACG GTGCCGATCC CTCCCCGGAC
AACCACGACG CCGAGCCTGA GCCCGAGCCC GATCTTGAGT CCGAGCCCGA GCCGGCCGGG
GTGAGGCCCG CATGA
 
Protein sequence
MPDVAPRFGA TPGWESDLSD EPGPRDACGV FGVWAPGEDV ANLAYYGLYA LQHRGQEAAG 
IAVGDGRTVV VFKELGLVAQ VFDEVTLSSL SGHVAVGHTR YSTTGSSTWE NAQPSYRTAR
FGGPIALGHN GNLTNIVELA RTLGAERDRL RATTDSDLIT AMLADHPGPT LVDAAMDVLP
RLTGAFSLVF SDASTLYAAR DAHGIHPLVL GRLDGHPDGA WIIASETAAL DIVGATLVRE
IVPGELVVID AEGVRSRTFA RPDPHGCLFE YVYLARPDTS IAGRSVHATR VDVGRQLARE
APVEADLVIP VPQSGVPAAV GYAEESGIPF GEGLVKNSYV GRTFIQPSQT IRQRGIRLKL
NPLRDVIEGR RLVVVDDSIV RGNTQRALVR MLREAGAAEV HIRISSPPVR WPCFYGIDFA
TRAELIASEA GIEEIRASLG ADSLAYVSLE GLVEASRQPA GTLCRACFDG VYPVPLNDSD
KLGKHRLEAT GGAETTEDMI AEALRRGVTL GMNGADPSPD NHDAEPEPEP DLESEPEPAG
VRPA