Gene Francci3_2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2839 
Symbol 
ID3904751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3343446 
End bp3344696 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content72% 
IMG OID637880160 
ProductMcrBC 5-methylcytosine restriction system component-like 
Protein accessionYP_481926 
Protein GI86741526 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.117814 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGCGC CGGTGGAGCT GACCGAGGGC GCCGGGTGGC AGCGGCGGAA GCTGAGCCCG 
GGCCAGGCCG ATGCGCTCGA TGCCAGCGAG GTGGCGCAGG TGCGGCAACG GCGTGCCGAC
GGTACCTGCG AGGTCAAGGA CAACGCCCTG GTCGGCACCG TGCGCCTCGG GTCGGGCGAG
GATACGTTCG AGGTTCGCAT CCGACCCAAG GTCACCATCC GCCGTCTGCT GTTTCTGCTT
GGCTACGCGC AGGATCGCGG CAGGTGGTTC GAGGACGAGG TCCAGGCGGC GGAGGAACCG
GATCTTCTGC CCGCGGTCGC CGCGGCGTTC GCCCGAACCG CCTCGCGGGC GCTCGCGCAC
GGGGTGCCGC GAGGCTATCG GCAGGTGGAT GCGGCGCTTC CCGTCCTTCG CGGCCGGCTG
CGCGAGTCCG CGCAGCTCCG GCAACGGTCC GGGGTGATGT TCCCCCTCGA GGTGCGCTAT
GACGAGCGCA CCGTCGACAC CGCCGAGAAC CGGTTGCTGC TCGCCGCCAC CCGCTCGCTG
CTCGCCCTGG CCGGGGTGGC ACCGGCCACC GCCCAGGAGC TGCGCCGTAT CGCCGCCGCC
CTGGACGGTG TGGCCGAGCC GGCGCACGGC CCCGTCAAGC CGCCGGACTG GGTGCCAACC
CGGGTGAACG CGCCTTACCA TGCGGCGCTC CGGCTCGCCG AGACGGTCTT GCGCTCGTCT
TCCTTCGAAC GGGAAGACGG GGAGACGCTC CGGGTGGACG GCTTCGTGGT GAAGATGTGG
GAGGTCTTCG AGGACTTCGT GACCCACGCC GTCGACGAGG TCCTCACCCA CCGCGGCGGT
GAGGTCCGCC TGCAGGACCG CACCCACCAC CTCGACGAGG ACCGGACGCT GGAGATGTGC
CCCGATCTCG TGCTGTACCG GCCGGAGGGC CCGGGCGGGC GGATGATCCC GGCGGTTGTC
CTCGACGCGA AGTACCGGCT CGCGATCCGA CAGGGCGCGC GCGCACACGT GTACCACCAG
ATGATCGCCT ACTGTGCCCG GCTCGGCGCC CGGCAGGGAT GGCTCGTCTA CGCCGGCTCG
GAGCGGGCCG ACGGCCAGCC CGGCGGTCGT GGCGACGTCA TCCGGAGCCG GATCGGGGGT
CCCACGCCCA TCGGGCTCGT GACGTACGTG CTCGACCTGA GGCTCCCCCT GGCCGAGTTG
CGGGCCAGGA TCGAGCGGAT CGCCGACGAT ATGGTCACCC CGTCCGTCTG A
 
Protein sequence
MLAPVELTEG AGWQRRKLSP GQADALDASE VAQVRQRRAD GTCEVKDNAL VGTVRLGSGE 
DTFEVRIRPK VTIRRLLFLL GYAQDRGRWF EDEVQAAEEP DLLPAVAAAF ARTASRALAH
GVPRGYRQVD AALPVLRGRL RESAQLRQRS GVMFPLEVRY DERTVDTAEN RLLLAATRSL
LALAGVAPAT AQELRRIAAA LDGVAEPAHG PVKPPDWVPT RVNAPYHAAL RLAETVLRSS
SFEREDGETL RVDGFVVKMW EVFEDFVTHA VDEVLTHRGG EVRLQDRTHH LDEDRTLEMC
PDLVLYRPEG PGGRMIPAVV LDAKYRLAIR QGARAHVYHQ MIAYCARLGA RQGWLVYAGS
ERADGQPGGR GDVIRSRIGG PTPIGLVTYV LDLRLPLAEL RARIERIADD MVTPSV