Gene Francci3_0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0095 
Symbol 
ID3902928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp117015 
End bp118124 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content72% 
IMG OID637877425 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_479218 
Protein GI86738818 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.605031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGTG GATTGAACTC GTACCGGGCG GCCGGGGTCA ACGTGGCCGC GGGCGAGCGT 
GCCGTCGAGC TGATGCGCGG TCACGTCGCG CGCGCGATCC GGCCCGAGGT CGTCGGTTCG
CTCGGCGGAT TCGCCGGCCT GTTCGCGCTG GATACGGCGC GGTACCGTCG GCCGCTGCTC
GCCTCGTCGA CCGACGGTGT GGGCACCAAG ATTGCCGTTG CCCGGGCCCT GGACACGCAC
GACACCGTGG GCATTGATCT GGTCGCCATG GTTGTCGACG ACCTCGTGGT GTGCGGCGCG
GAGCCGCTGT TCCTGCTCGA CTACATCGCG TGCGGCTCCC TGGTCCCCGC GCGCGTGGCC
GAGATCGTCT CCGGCATCGC AACCGGCTGC GAGCAGGCGG GGGCGGCGCT GGTCGGCGGG
GAGACCGCCG AGCACCCCGG GCTCATGGGT AGCGACGACT ACGACCTGGC CGCGACGGGG
GTCGGGGTCG TGGAGGCCGA CGACGTGCTC GGGCCGGAGC GAGTCCGGCC CGGGGACGTG
GTGGTCGCGA TGGCATCATC CGGCATCCAC TCCAACGGCT TCTCGCTCGT ACGGCATATC
TTGTTCGGTC CTGTCGATTC TGGCCAGCCC GGTGGGATTC CCGAGACCGC GCGGGAGGAT
CTGGAGGCAT ACGTCCCGTC CCTGGGGGGC ACGCTGGGCA CGTCCCTGCT GGTTCCGACC
CGCATCTATG CTCGGGACTG CCTGGCGCTG GCCGCGGCTG TCGAGGTGCA CACCTTCGCC
CACATCACCG GCGGCGGTCT CGCGGCGAAC CTCGCCCGGG TCATCCCGCC GGGCCTGCTG
GCCACGGTGG ACCGGGCGTC GTGGTCAGTG CCCCCGATCT TCGGTCTGCT CGCCGAGCGC
GGCGAGGTGA CCCAGGCGGA CATGGAAGCC ACCTTCAACC AGGGAGTCGG CATGGTGGCG
GTCTTGCCGG CCACCGCGGT CGCCGACGCT CTCGCGCTGC TCGCCGCGCG GGACGTGCCG
GCCTGGGTGG CAGGGGAGGT CGGCACGGCG GACGCTCCGG AGCCAGCCGG CGTGGCCAGG
GCCCGGCTCG CCGGCCGGCA TCCACGCTGA
 
Protein sequence
MSGGLNSYRA AGVNVAAGER AVELMRGHVA RAIRPEVVGS LGGFAGLFAL DTARYRRPLL 
ASSTDGVGTK IAVARALDTH DTVGIDLVAM VVDDLVVCGA EPLFLLDYIA CGSLVPARVA
EIVSGIATGC EQAGAALVGG ETAEHPGLMG SDDYDLAATG VGVVEADDVL GPERVRPGDV
VVAMASSGIH SNGFSLVRHI LFGPVDSGQP GGIPETARED LEAYVPSLGG TLGTSLLVPT
RIYARDCLAL AAAVEVHTFA HITGGGLAAN LARVIPPGLL ATVDRASWSV PPIFGLLAER
GEVTQADMEA TFNQGVGMVA VLPATAVADA LALLAARDVP AWVAGEVGTA DAPEPAGVAR
ARLAGRHPR