Gene Francci3_3186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3186 
Symbol 
ID3903911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3775035 
End bp3776555 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content73% 
IMG OID637880510 
Productbifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II protein 
Protein accessionYP_482272 
Protein GI86741872 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000678178 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.119885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAT CCGCGGCCGG CGCCTCCTCG ACGGCCACCC TTCCCGCCGG TGCCCTTCCC 
GCCGGCTACG CCGACGTGCG GTTCGCCGCC ATCGAGGACG CGCTCGCCGA GATCGCCGCC
GGCCGCCCGC TCGTCGTCGT CGACGACGCC GACCGGGAGA ACGAGGGTGA TCTGATCTTC
GCGGCGGAGG CCGCGACACC CGAGCTGGTC GCGTTCATGA CGCGGTACAC CTCCGGTGTG
ATCTGCGTGC CGATGGACGC CGCCGACACC GACCGGCTCG AACTGGACCA GATGGTCCCG
CACAACACGG AGCGGATGGG CACCGCGTTC ACCATCACCG TGGACGCCCG CGACGGCGTC
TCCACCGGGA TCTCCGCCGC CGACCGGGCC CGCACCATCC GGTTGCTGGC CGACCCGCGG
ACCACGCCCG GCGACCTGAG CCGGCCGGGC CACATCTTCC CGCTGCGGGC CAAGGACGGC
GGGGTGCTGC GCCGCCCCGG CCACACCGAG GCCGCGGTCG ACCTGGCCCG GCTCGCCGGG
CTGCGGCCGG CCGGTGCGAT CTGCGAGATC GTGAACGACG ACGGGACGAT GGCCCGGCTT
CCCCGGCTCG CCGAGTTCGC CCGCGAGCAC GGGCTGCTGC TGATCTCCAT CGCCGATCTC
ATCGCTTACC GTCGGCGGAC CGAGAAGCAG GTGGTGCGGG TCGCGGAGGC GTCGCTTCCG
ACGCGTTACG GGGCGTTCCG GGCGGTCGGG TTCCGCAACA TCCTCGACGG CGTCGAACAC
ATCGCGCTGA TCCGCGGCGA GCTCGGTGAC GGCGCGGACG TGCTGGTCCG GGTGCACAGC
GAATGCCTCA CCGGGGACGT CTTCGGCTCC CGGCGCTGCG ACTGCGGAAC CCAGCTCGAC
GCGGCACTGC GCATAATCGC GACCGAGGGA CGCGGAGTGG TGCTCTACAT GCGGGGCCAC
GAGGGCCGGG GCATCGGCCT GATGCACAAG CTGCGGGCCT ACCAGCTGCA GGACGCCGGC
CACGACACCG TCGACGCCAA CCTCGCGCTG GGTCTGCCGG CCGACGCCCG TGACTACGGC
ACCGGGGCCC AGATCCTGGT CGACCTCGGC GTGCGCGGCA TCCGGCTGCT GTCCAACAAT
CCGACGAAGC GGGCCGGGCT GGAGGGCTAC GGCCTGCGCA TCGTCGAACT CATCGGGATG
CCGGTCACCC AGACGCCGGA GAACCTGCGC TACCTGACCA CCAAGCGGGA CCGGATGGGA
CACGTCATTC CCGGTCTGCC CAGCATTCCC GGTCTGCCCA GCATTCCCGG TCCGCCCGGC
CTTCCGGGTC CGCCCGGCCT TCCGGGTCCA CCCGGCCTGT CCGATCTGCC TAGGGCCGTG
GGCGAGGACG AGCCGGCCGG CTCGCGCTGG ATCCCGGGGG CGCCGAGGGC GCCGAGGGTA
CCCGTCACCT CCGGCTGCAC GGCCGTCGCG GGCCCGGGTA CGTCCGCCGG GGTCGACTGT
GCCGAGGGGG AGGGTCTGTG A
 
Protein sequence
MTTSAAGASS TATLPAGALP AGYADVRFAA IEDALAEIAA GRPLVVVDDA DRENEGDLIF 
AAEAATPELV AFMTRYTSGV ICVPMDAADT DRLELDQMVP HNTERMGTAF TITVDARDGV
STGISAADRA RTIRLLADPR TTPGDLSRPG HIFPLRAKDG GVLRRPGHTE AAVDLARLAG
LRPAGAICEI VNDDGTMARL PRLAEFAREH GLLLISIADL IAYRRRTEKQ VVRVAEASLP
TRYGAFRAVG FRNILDGVEH IALIRGELGD GADVLVRVHS ECLTGDVFGS RRCDCGTQLD
AALRIIATEG RGVVLYMRGH EGRGIGLMHK LRAYQLQDAG HDTVDANLAL GLPADARDYG
TGAQILVDLG VRGIRLLSNN PTKRAGLEGY GLRIVELIGM PVTQTPENLR YLTTKRDRMG
HVIPGLPSIP GLPSIPGPPG LPGPPGLPGP PGLSDLPRAV GEDEPAGSRW IPGAPRAPRV
PVTSGCTAVA GPGTSAGVDC AEGEGL