Gene Francci3_3023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3023 
Symbol 
ID3904376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3589046 
End bp3589804 
Gene Length759 bp 
Protein Length252 aa 
Translation table11 
GC content73% 
IMG OID637880343 
Productphosphoribosyl isomerase A 
Protein accessionYP_482109 
Protein GI86741709 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase 
TIGRFAM ID[TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase
[TIGR01919] 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase/N-(5'phosphoribosyl)anthranilate isomerase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.139685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCTTA CTCTGCTGCC CGCCGTGGAC GTGGCCGACG GCAGGGCCGT CCGACTCGTC 
CAGGGCGAGG CCGGTTCCGA GACCTCGTAC GGCGACCCGC GGGAGGCGGC GCTGACCTGG
CAGCGCGACG GCGCCGAGTG GATCCACCTG GTCGATCTCG ACGCTGCCTT CGGCCGGGGG
TCGAATCGGG AGCTCATCGC CGAGGTGGTA CGCGCGGTGG ACGTGGCCGT CGAGCTCTCC
GGCGGCATCC GCGACGACGC ATCGCTCGAC GCGGCGCTGG CCACCGGCGC GGCCCGGGTC
AACATCGGCA CGGCTGCGCT CGAGGATCCC GACTGGGTCC GCCGGGCCAT CGACCGGGTC
GGTGACCGTA TCGCGGTCGG TCTCGACGTC CGGGGGACCA CGCTGTCGGC CCGGGGCTGG
ACGCGGGACG GTGGCGAGCT GTTCGATGTG CTCGCCCGCC TCGACGCCGA CGGCTGCGCC
CGGTACGTGG TGACGGATGT GCGCCGAGAC GGCACGCTCA CCGGGCCGAA CGTCGAGCTC
CTGCGCTCCG TGACCGCGGC CACCAGCCGG CCGGTGGTCG CCAGCGGCGG CGTGGCCACG
CTCGACGACC TCACCGCGAT CGCCGTGGTG CCCGGAGTGG AGGGCGCGAT CATCGGCAAG
GCGCTCTACG CCGGCGCCTT CACGCTGCCC GAGGCCCTGG CCGTTGCCGG GAATATCGGG
AATATCGGAA ACGGATGTGC GGGTGCGGTG GGTCGATGA
 
Protein sequence
MTLTLLPAVD VADGRAVRLV QGEAGSETSY GDPREAALTW QRDGAEWIHL VDLDAAFGRG 
SNRELIAEVV RAVDVAVELS GGIRDDASLD AALATGAARV NIGTAALEDP DWVRRAIDRV
GDRIAVGLDV RGTTLSARGW TRDGGELFDV LARLDADGCA RYVVTDVRRD GTLTGPNVEL
LRSVTAATSR PVVASGGVAT LDDLTAIAVV PGVEGAIIGK ALYAGAFTLP EALAVAGNIG
NIGNGCAGAV GR