Gene Francci3_3028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3028 
Symbol 
ID3904381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3593749 
End bp3595581 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content73% 
IMG OID637880348 
Productmethyltransferases-like 
Protein accessionYP_482114 
Protein GI86741714 
COG category[R] General function prediction only 
COG ID[COG1568] Predicted methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.274424 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCCTC CACATACCCT GTCAGGACCG GATCACCCGG GCCGGGCGCC GGGTCCGGCC 
GAATCCGGCG AGTCGTTGGC GACCCTGCTG GCGGGCTACC GGGTGCATGC CCGGGCGCCT
CGCGCGGCGG TGGCGGCGCT GACCGAGCAG CCACGCACCC TGCGCCAGCT GGTCCAGAGT
TCGGGCCTGC CCCGACGGAG CGTGGAGGAG ATCCTCGCCA GCCTCGGCGA CGATCTGCGG
ACCGGCCCCG ACGGGCGTCA CGTGCTGCGC CCCGCGTCGA TCGACCGCTA CCGCGGCCTG
ATCCGCTATG ACGAGCTGAG CGGACAGACT CCGCTCGATC CCCTCGCCGC CGTGATCACC
CGCCACGGTC CGCTGGTCAC GACCATGCAT GATCTCATCG CTGCCGCGCC CCGACCGCGG
GCCGACCTCG ACCACGTACC CGCGACGGCG ACGACCGTCG TGCGCCGGGC CGTCTGGCTG
CGGACGCGTT ACGACCTGCG CGGCGCGCAT CTGCTGTGCA TAGGTGATCA TGATCTCACC
TCGCTGGCGG CGGTCTCGTT GATCGACGGG CTCACCGTGA CCGTCGTCGA CATCGACGAC
GAGCTGCTCG CCTACCTGGA CAGCTCGGCC CGGTCGTTGG ACGTCCGGCT GCGGTGCCTC
TACGCCGACC TGCGCTTCGG CCTGCCGCCG GCGGTGGTGG GCACAGCCGA TCTGGCATTC
ACCGATCCGC CCTACACCCC GGAGGGCGTG GCGCTGTTCA CCGGGCGCGG CGCGCAGGGT
CTCGCGGACC GCGAACATGG TCGCGTCCTG CTGGCCTACG GGTTCAGCGA CCGGGTACCG
ACGCTGGGTT GGAAGGTGCA GCGGGCACTG ATCGATCAGG GCTTCGTGTT CGAGGCCATC
TGGCCAGGCT TCCACGTCTA CGAGGGCGCG GAGGCGGTGG GCGCCAGAGC CGACATGTAC
GTCTGCCAGC CGACCCCGGC CACCTGGAAG CAGCTGGACC GCTCGGCCAC GGCGGCCACA
ACCGCGACGG CCGCCATCTA CACCCGGGGC CGGCAGTCCA CGCAGAGTCG GCCGACGCGC
CTGACCGCCC CGGTCCTCGA CGCGGTGGCC GCCTTCCTCG CCACCGGCCC GGCGGGTCGC
GCCGTGTTCG TTGGCGAGCG GCGGGAGGTC GACGCGGTCC ACGTGCGTCT CGCGACCGTG
TTCGACCGGG GGCTGCCCGC GTTCGCCTCG ACCGGCCCCG GCTCCTCGAC CGGCCCCGGC
GCCTCAACCG AGGACGGAAC GGTCAGCGTG GCCACCGACC TGTCGGACGA TCCGGGCCCC
TGGCTGACCA GGCTCCTCCT GGCGGTCAAC GCGGACCGGC TCGCCGTCGT CGTCTCCTCC
GATCATCCCG ATCTGGGCGT CCGCCGCCGG CGGGCCCAGG ATGACCCGCT GCGGCAGCTG
CGGGCGAAGT GGACCGCCAC CCCCGCCCGG GACCTCGGGG ACCTGCGGCT CGTGACGTTC
ACGGCCGTCG GCCCGGCGGC GCTCGCCCCC GCCGACCGGC TCGCCCGGTG GCTGCTGGAC
CGCCCGCATG GCAAGATCGG CAACGTCTGG CGGGACGGGC TCATCCGGAT CGTCCGGGAG
GATTCGGGGC GCACGCTCTC CCAGCGCGAC GCCCGGGCCG CCGTGACCCG GGCCGCGCGC
GATCCCGACC TGCTGGCCGC CCGACTGATC GATCTTCCCC GGCACGCCCT CGAGGGAATC
CTCGCCGCGG TGTCGTCCGG CGACACGCTG CCGGCCGAGC CGGTGAGACC GGGCTGGATC
CGGCAAAATG GGCAGACGCT GCGAAACGAG TAG
 
Protein sequence
MSPPHTLSGP DHPGRAPGPA ESGESLATLL AGYRVHARAP RAAVAALTEQ PRTLRQLVQS 
SGLPRRSVEE ILASLGDDLR TGPDGRHVLR PASIDRYRGL IRYDELSGQT PLDPLAAVIT
RHGPLVTTMH DLIAAAPRPR ADLDHVPATA TTVVRRAVWL RTRYDLRGAH LLCIGDHDLT
SLAAVSLIDG LTVTVVDIDD ELLAYLDSSA RSLDVRLRCL YADLRFGLPP AVVGTADLAF
TDPPYTPEGV ALFTGRGAQG LADREHGRVL LAYGFSDRVP TLGWKVQRAL IDQGFVFEAI
WPGFHVYEGA EAVGARADMY VCQPTPATWK QLDRSATAAT TATAAIYTRG RQSTQSRPTR
LTAPVLDAVA AFLATGPAGR AVFVGERREV DAVHVRLATV FDRGLPAFAS TGPGSSTGPG
ASTEDGTVSV ATDLSDDPGP WLTRLLLAVN ADRLAVVVSS DHPDLGVRRR RAQDDPLRQL
RAKWTATPAR DLGDLRLVTF TAVGPAALAP ADRLARWLLD RPHGKIGNVW RDGLIRIVRE
DSGRTLSQRD ARAAVTRAAR DPDLLAARLI DLPRHALEGI LAAVSSGDTL PAEPVRPGWI
RQNGQTLRNE