Gene Francci3_3490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3490 
Symbol 
ID3905224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4160937 
End bp4162076 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content77% 
IMG OID637880812 
Productmolybdopterin-guanine dinucleotide biosynthesis protein A-like 
Protein accessionYP_482572 
Protein GI86742172 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0746] Molybdopterin-guanine dinucleotide biosynthesis protein A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.5944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGCCG GGGGTGGGGC GCGCCGGCTC GGCGGCCGGG ACAAGCCGGC GGTGATGGTC 
GGCGGTTCCA CCCTGCTCGA GCGGGTGCTG TCCGCGGTGT TGGACGCCGA ACGGGTGGTC
ATCGTCGGTC CCCGGCGCGA CCTGGCCGTC GATCCGATTC CGCCTGGTCG GGTGCGCTGG
TGTCGGGAGG ATCCGCCCGG CGGGGGACCG GTCGCGGCGA TCGCCGCCGG CCTTGTCGAG
ATCACCACAC CGTTCGTCGC CGTCCTCGCC GCCGACCTGC CGTTCCTGAC CGGCCGGGAG
ATCGCGCTGC TGCGCCGCGG GGTCGCGGAC CCCGCGGCGC AGGCCGCGCT GCTGGTCGAC
CCGGACGGCC GCCGGCAGTT TCTGGCCGCG GTGTGGCGGA CGGCCTCGCT GTGGGCGGCG
CTACCCGCCG ATCCGATCGG GCGCCCGGTG CGCGGCCTGT TCGCAGACCG TCCGGTGACC
GCGGTCCGGG CGCACGCCCG GACCTGCCTC GACTGCGACG AACCGGCGGA TGTGGCGCGG
GCCCGCAGCT GGGCCGCGGT GGGCGAGCGT GGCCCGGCCC GGCACGATAG GCCTATGACC
TCCGCCGATG ACCAGCAGCC TGATCACCAG TCGTGGTCCG ATCATCGGGC TTCGGCGGAT
CCGCCGCCCC CGGAGGCCGG ACGTCATCGC CCGCCGCCCC CGGAGGCCGG ACGTCATCGC
CCGCCGCCCG CCGCGGCCGT CGGCTCGGGT GCCGGCAACG TTCGGGACAA GAACGTTCGG
GACAAGAACC TCCTGGAGCG GGACGTCCTG GCGCAGTGGG TGTCGGACGT CTGCGCCGAG
CTCGGCCTCG ACGCGGCACG GATCGACGTG GGCGCCGTCC TCGATCTCGC GCGCGACGTC
GCCCACGGGG TCGCGCGTCC CGCGGCGCCA CTCACCGCGT TCCTGGTTGG TCTGGCCGCC
GGCCGGAACG CGGGCGGTGC CCACGGTGAG GAGGGCGGAG GCGAGGGGGG TACGGATGGT
GAGCGGGAGG CCGCGGCTGC GCGCGCGGCG ACGTCGGCCG TGCTCGGCCT CCTGGCGCGG
GCGCGGGCGG GGACCGGACC CGCTCAGCCC ATCCGGCCCG GACCCGCCTC GTCGAGGTAG
 
Protein sequence
MLAGGGARRL GGRDKPAVMV GGSTLLERVL SAVLDAERVV IVGPRRDLAV DPIPPGRVRW 
CREDPPGGGP VAAIAAGLVE ITTPFVAVLA ADLPFLTGRE IALLRRGVAD PAAQAALLVD
PDGRRQFLAA VWRTASLWAA LPADPIGRPV RGLFADRPVT AVRAHARTCL DCDEPADVAR
ARSWAAVGER GPARHDRPMT SADDQQPDHQ SWSDHRASAD PPPPEAGRHR PPPPEAGRHR
PPPAAAVGSG AGNVRDKNVR DKNLLERDVL AQWVSDVCAE LGLDAARIDV GAVLDLARDV
AHGVARPAAP LTAFLVGLAA GRNAGGAHGE EGGGEGGTDG EREAAAARAA TSAVLGLLAR
ARAGTGPAQP IRPGPASSR