Gene Francci3_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3938 
Symbol 
ID3906897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4715392 
End bp4716726 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content70% 
IMG OID637881265 
Productmethyltransferase type 12 
Protein accessionYP_483017 
Protein GI86742617 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.16065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.145676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCCG CAACCCCGGC CCAGCGCGGT AGTGGCACCG CCGAGGCCGA CGCGACCCAG 
AAGCAGGCGA CCCAGAAGCA GGATGCTCCC GGCGGCATCG CCACGGCGGT AATCGCCTGC
CGGTCCTGCG GCGGACCGGC CCCGCGCCTG TTCCTCTCCC TGGGGTCCAC CCCCATCGCC
AACCGCCTGG TGCGCGCCGA CGCCCTCGAC GCGACCGATC CGTCGTTCCC GCTTGAGGTC
GGCTTCTGCG AGGCCTGCGC ACTCGTCCAG CTCACCCACG AGCTGCCGGC GTCCGAGATC
TTCGACGAGG ACTATCCCTA CTTCTCCTCG TTCTCCGACA TGCTCGTCCG CCACGCGGAG
AAGCACGTGA TCGACCTGAT CGCGAGCCGC AACCTCGGGC CGGACAGCCT GGTGGTCGAG
GTCGCCAGCA ACGACGGCTA CCTGCTGAAG GCGTTCGTCG AGCGGGGCAT CCCGGTCCTC
GGGATCGAGC CGACCCCGGG CCCGGCCGCG GCCGCCCGGG AGGCGGGCGT GCCGACCCGC
GAGGAGTTCT TCGGCGCGGA GCTCGCCCGT CAGCTCGTCG CGGAGGGTCG CAAGGCCGAT
GTGATCATAG CGAACAACGT GATGGCCCAC GTCCCGGACC TCAACAGCTT CGTCGAGGGC
TTCTCGATCC TGCTCGCCGA CGGCGGCCTC GTCGACGTCG AGAACCCCGG GGTCGGCGCG
TTGCTGGCCC ACACCGAGTT CGACACGGTC TACCACGAGC ACTTCTGCTA CTTCTCCACG
ATCGCGGTCG ACGCCCTGAT GCGCCGGCAC GGCCTCGCGC TCGTCGGCGT CCAGGAGTTC
CCCGAGCTGC ACGGCGGCAC CCTGCGGTGG AGCATGCAGC ACACCGCCAC CGCGGACCCG
GCCGAGTCGG TGGCGGCGGT GCTCGACGCC GAGCGGGCCG CCGGGCTCGA CACGTTCGAC
CGGTACGCCA GCTTCGGCGA CGACGTCCGC GCCGTGCAGG ACGAGCTGGT GGCGTTGCTG
CGCTCGCTGC GCGCCGACGG TAGGACCATC GCCGCCTACG GCGCGGCGGC CAAGGGAGCG
ACCCTGCTGA ACTCCAGCGG CATCGGTACC GACCTGCTCG ATTTCGTCGT CGACCGCAAC
ATCCACAAGC AGGGCCGGTA CCTGCCCGGC GCCCGGTTGC CGATCCTCGA TCCTGCCGTC
CTGCTGGAGC GGCAGCCCGA CTACCTGCTG CTGCTGGCGT GGAACGTGAA GAAGGAGATC
ATCGCCCAGC AGGCCGAGTA CGCCGCGCGC GGTGGCTCCT TCATCGTGCC GGTTCCCCGG
CCCGTAGTGC TGTAG
 
Protein sequence
MSSATPAQRG SGTAEADATQ KQATQKQDAP GGIATAVIAC RSCGGPAPRL FLSLGSTPIA 
NRLVRADALD ATDPSFPLEV GFCEACALVQ LTHELPASEI FDEDYPYFSS FSDMLVRHAE
KHVIDLIASR NLGPDSLVVE VASNDGYLLK AFVERGIPVL GIEPTPGPAA AAREAGVPTR
EEFFGAELAR QLVAEGRKAD VIIANNVMAH VPDLNSFVEG FSILLADGGL VDVENPGVGA
LLAHTEFDTV YHEHFCYFST IAVDALMRRH GLALVGVQEF PELHGGTLRW SMQHTATADP
AESVAAVLDA ERAAGLDTFD RYASFGDDVR AVQDELVALL RSLRADGRTI AAYGAAAKGA
TLLNSSGIGT DLLDFVVDRN IHKQGRYLPG ARLPILDPAV LLERQPDYLL LLAWNVKKEI
IAQQAEYAAR GGSFIVPVPR PVVL