Gene Francci3_4244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4244 
Symbol 
ID3907210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5065287 
End bp5066936 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content69% 
IMG OID637881570 
Productformylmethionine deformylase 
Protein accessionYP_483319 
Protein GI86742919 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0242] N-formylmethionyl-tRNA deformylase 
TIGRFAM ID[TIGR00079] peptide deformylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.928565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTACG AATCTGGACC TTACCGGGTA TCGCCCAGGC TGCTGGCCCG GGAGGACTTC 
CGGGCGGCGT GCGCGACGCG GGACTTCCGC ACGGTGTTCC GGCTGATGAA GAAGTACGAC
GGTGCCAGCC AGAATCGCAT CGCATCTCCG GTGGAAGGGC TCAGCCAGAG CAGGGTCAGC
CGGATCATGC AGGGCGAGGA CCACATCGCC CGGCTCTCCC TGATCGAACG CATCACCGAC
GCGCTGCGCA TCCCGGGCAG TTATTTCCAC CTGGCCGCAC GGCCGTGGGA GGCGGCACGG
GACGTGCGCG AGCCGCCCGG GCCGCCAACG GTTCCGGCCG GATCCGGCCC GACGCCGGTA
GCCCACGTCG CGACCCCCTC CGTCGACCCG GCCCCGGCCC CCGGCCCCGT CGTCGCCGAG
AGCCCGGGCC GGCTCCCCCC CGGACCGGGG GGCCTGGTCG TCGAGGAGGA CCGGGCCGAA
CTCCGCTACG ACGACAGCAT GTACGTCGCC CACCAGCGAA GACGTCTTCT CAACGCCGGC
AGCGAACCGG TAACCCGGTA TCTGATGCGG ATATCGGTCG ACCGGTTCCC GGGAGATCCG
GAACGCTCCG CGCGGTTGTA CCGATCCGAT CCGCTGCGCT GGGAGGACCT CCAGCTCAGC
GCGCACTGTG GCGGCGAGCA GATGGCCTGG CAGGTCAAGA CCGACTGGGA CGCGCTCAAG
GAGGTCTGGC TGCTCTTCGA GAACAGCCGA GGACGATTTC CGCTCTATCC CGGCGAATCG
GCGTGGATCA ACTACAGCTA CCAGGTAAGC AGGGAGAAGT GGGGGCCGTG GTTCCAGCGT
GCCGTGCGGG TACCGACGAG TTTCCTCTCG GTGCGGCTGG ACTTTCCGGC CGAGCTGGAT
CCCGTGGTCT GGGGCATGGA GACGTCGATG ACCGCCGATG CCTTCCCGTT GCGGACCGCA
GTCGCCCGCA CCGACTCCGG CGACCGGCGG ATCTTCACCT GGTCGACCAC CGAGCCGCCG
CTGCACGCCC GCTACCGGCT GGAATGGAAC TTCCGGGCCA CCGTGCCGGA ACGTGATCCG
GCCACGGAGA CCGGAACACG TCCCAGCGAT CAGATGGCGG CCGTCGGCAT CGTGCAGGAG
GGTGAGGCGA TCCTGCGGCA GCCGGCCCGC CCGTTCGCTC TGCCCAACGA GGCCGAAGAC
GCGCGCCGGG TCGTCGCCGA ACTGTCCTCC GCCCTCGAAC GGGTGTCCGC CCTGCACACC
TTCGGCAAGG GCCTGGGCAT CGCCGCGCCG CAGGTCGGCA TCAACCGCGC CGCGGCGATC
GTGCGTACCG CGGGCGGCGA CACGCTCACG CTGCTCAACC CCAGCGTCAT CGAGACCTCC
CGAGAGACCG ACGAGCAGTA CGAGGGCTGC CTGAGCTTCT TCGACGTCCG TGGCCTCGTC
CCCCGGCCGC TGGAGCTGCA CGTCGAACAC ACCGATATCG ATGGCAACCG GCACATCACC
GTCTATCGGC AGGGGCTGGC GCGGCTCGTC GCTCACGAGA TCGATCACCT GCACGGTCAG
CTCTACACGG ATCGGATGCG GGCCGGAACC ACACCGATCC CGGTGGAGCA GTACCGGGGT
ACCGGCTCGG CCTGGTCCTA CAACCGCTGA
 
Protein sequence
MSYESGPYRV SPRLLAREDF RAACATRDFR TVFRLMKKYD GASQNRIASP VEGLSQSRVS 
RIMQGEDHIA RLSLIERITD ALRIPGSYFH LAARPWEAAR DVREPPGPPT VPAGSGPTPV
AHVATPSVDP APAPGPVVAE SPGRLPPGPG GLVVEEDRAE LRYDDSMYVA HQRRRLLNAG
SEPVTRYLMR ISVDRFPGDP ERSARLYRSD PLRWEDLQLS AHCGGEQMAW QVKTDWDALK
EVWLLFENSR GRFPLYPGES AWINYSYQVS REKWGPWFQR AVRVPTSFLS VRLDFPAELD
PVVWGMETSM TADAFPLRTA VARTDSGDRR IFTWSTTEPP LHARYRLEWN FRATVPERDP
ATETGTRPSD QMAAVGIVQE GEAILRQPAR PFALPNEAED ARRVVAELSS ALERVSALHT
FGKGLGIAAP QVGINRAAAI VRTAGGDTLT LLNPSVIETS RETDEQYEGC LSFFDVRGLV
PRPLELHVEH TDIDGNRHIT VYRQGLARLV AHEIDHLHGQ LYTDRMRAGT TPIPVEQYRG
TGSAWSYNR