Gene Francci3_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1102 
Symbol 
ID3905773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1313124 
End bp1316426 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content60% 
IMG OID637878434 
Productputative DNA methyltransferase 
Protein accessionYP_480211 
Protein GI86739811 
COG category[R] General function prediction only 
COG ID[COG4889] Predicted helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.822518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGG CAACGGATCA TTGGCTGCTG AAGGCAGTAT CGGATTTTGG TCGGGCGTGT 
GGCACCAAAC TCTCCGGAGG TGGCAGCCCG GAAGCCTCGA TACGAAGCCC GCTGGAAGGG
CTCCTGGCGG CAGTTGGCAG ACATCACAAG CTGTCCGAAC TTACCTGGCA TGACGAAGTG
AGGCTTCCCG AACTCGGAGT GCGGCCCGAC TACGCAGTCC GGCTGAGTGG CCTGGTCACC
GGATACATAG AAATCAAGAA GCCCGGCCTG TCCGTAGACC CTCAGAATTT TACCGGGCAC
AACAAGGAGC AATGGGAACG GCTGCGAGAC CTTCCAAATC TACTTTACAC CAACGGCACC
GAATGGCGGC TTTTTCGGGA CGGGATCCAG ATAGGAGAGA CCATTCACTT CACCGGAACT
CTCCGGACAA GCAGGGAGCG GCTTCGGCCA CCGGATCCGG CCGCCTTCGA CGCACTGATC
ACGACGTTCC TCGTCTGGAG CCCGCCACCA ATCCAGAACG TTGGCAGGCT CGTGCAGAAC
ATCGCACCGT TGTGCCGCCT CCTCCGCGCT GCCGTCCTGG AGCAGCTCAC CGCCGAAGCG
AAGTCGGACG CACCGGAGAA CGACGGCCAC GCACGTCCCT TCACCGGCCT CCAGAACGAC
TGGCGGACGC TACTCTTCCC CGAGGCAGAC AGCCGGACCT TCGCCGACGG ATACGCCCAG
ACGGTAACCT TCGCGCTGCT GCTCGCACAA ACGGAGGGAA TTCCTCTCCT GCAGGTCGGA
TTTCATGATA TCGGCCGGAA GCTGAACGCC GGCCACGCAC TTATCGGCAG CGCGCTGCGA
CTGCTCACCG ACAATATAAA CGAGCGTTTT TCGGTGACGC TCGATCTACT GACCCGCACC
ATCTCCAGTG TCGATTGGCC TGCCATCCGG AACGGCAACC GGGATGCCTA CCTACATCTC
TACGAAAACT TCCTGACCAG GTATGACGCA CAGCTACGAC AGCAGAGCGG TTCGTACTAC
ACGCCGCGAG AGGTCGTGGA GCACATGGTC CGGTTGGCAG AGGATGTGCT GCGCACCCGG
CTCGGCAAGG ACCATGGTTA CGCCGATCCG GACGTTCGCA TCGTCGACCC GGCGATGGGA
ACCGGCACTT TCCTTCACGC GATCATCGAG AGAGTCGCAG AAACTGCCTC CGAAGGAGGG
GGCGAAGGCA TGGAAATCGA CGCGGTCGCG CAGCTCGCGG AACGACTGTA TGGCTTCGAG
CTGCAGATAG GTCCCTACGC GGTCGCCGAA CTCCGAACCT CAGACCTGTT AAGAGCGGAG
GAGATCCCAG CGCCGCGAGA GGGACTCAAT CTCTTCCTAA CAGACACGCT TGATTCGCCG
TTCAGTGATA CGCAGAAAGC ACTCTTCGGC TATCGGGAAC TGGCGGCTTC CCGACAGCGA
GCCGATCAGG TAAAGGGCAA TGTTCCCGTG ACTGTTGTCA TCGGCAACCC CCCCTATGAC
GACAAGGCGA AGAAACGCGG AAAATGGGCA GAGAAAAAAA TTCCAGGAGA GAATAGGACT
CCGCTCGACG CTTTCCGTCA CCCCGGCAAC GGCCGGTACG AACACGTACT GAAGAACATG
TACATCTACT TTTGGCGCTG GGCGACTTGG AAGGTCTTCG ATGCCCACGA GGCCGACCAA
CATGGATTGG TCTGTTTCAT CACGCCATCG GGCTTTTCCA CTGGCCCTGG CGGCCGTGGC
CTGCGCGACT ATCTCCGTCG CACCTGTCAC GAAGGATGGG TTATCAACCT CTCTCCCGAG
GGACAGCGCG CGGATGTTGC CACCCGCGTC TTTCCAGCAG TCGCGCAGCC ATTGGGTATC
TATATCTTCG TACGCCGCGC GGGGAGCTCT CCGGACGGCT CGACCCGGAT CCACTACCGC
TCGATCTCGG GCCGACGCGA GGAGAAGTTC AGACAGCTGG CCCATGTGGA GATTGACGAC
GGCGGCTGGA GGGATACGCA TCGCGAGCGA TCTCGTCCGT TCACGCCGAC TACGCAATCG
GCGTGGGAGG ACTTTCCAGA GATTTCCGAT ATATATCCGT GGGGAAGTCC TGGAGTAAAG
GCAAACCGCT CCTGGGTGGC TGCTCCCAGC GCGGAGATCC TGCGTCGCAG GTGGGCGCGT
TTGGTCCGCG AGGACGACCC GGACGCGAAA GAGGAGCTTT TCAAGGAGAC TCGGGACCGC
ACGCTGCTAC GTGCGGTGTC GCCGCTCCCC GGATACTCTG GACCTCGAGT GCCTATCGGG
CGGGAGAGTA GCCTGACCCC ACCTGTGATC CGGGTCGGGT TCCGCAGCTT CGATCGACAG
TGGCTGATTG CCGACCCCCG TGTGCTGGAT TTCGCCCGAC CAGACCTGTG GGCCTCACTT
CACGACGACC AGATCTTCCT CAACCAACAG TCCTCGCATG AGATCGAAAG CGGACCGGCG
GTTGTGGCGA CCGCATTACT TCCAGACACC CACCATTTTA ATGGGCGGGG TGGAAGAATT
CTGCCGCTAC TGCAACCGGA TGGCTCAGCC AACGTTCCAG GCGGACTGCT ATCCTATCTT
GCCAGCGCAT TCGGACTGGA GAAGGTTACC GTCTTGGATC TGGCTGCTTA TACAGTCGCA
GTCGCGGGGC ACTCGGCCTT CACCGAGCAG TTCGCGGAAG AACTGCTCAC GCCCGGCGTT
CGACTTCCTG TCACACGGAA CCTCGAGAGC TGGCGAAAGG CGTTGTCGAT CGGAGCCGAG
ATCCTGTGGG CTTCGACCTA CGGCGAAAGG TGTGCAGATC CGGATGCTGG TCGTCCCACC
CGCGACGTCA GGTTCAAGAG CGGGGATCCA CGTCAGGTCC GATACTTGAC CAATATCCGG
AAACAGATCC CAGAGAATTT TCACCACGAC CCAACAACAA ACACTCTACA TGTTGGCGCA
GGGTCTTTTG GGAACGTTCC AGAGGAGGTA TTGACCTTCA ACGTCGGCGC GATGCCAGTA
GTTAGGAAAT GGTTTGGATA TCGCAAGTTT TCCCCCAACA GCAAGAAGAC GAGCCCGCTC
GACGATATTC ACGTCGACAG CTGGCCCCGT GAGTGGGCCG CAGAACTCGT CGAGCTCCTG
TCGGCCCTAC GCCGCCTCGT GGATCTGGCA CCGGCGCAGC GCGATCTTTT GGCCGAGGTT
CTGGCGGGTC CCATGGTGAC CCAGCAGGAT CTCGCGGTTG CGGGAGTGCT CCCTGTGCGT
CCCGCCGCAC GGAAGCCGCG GTATGAAAAC CCGGTCGGGC TGTTCGTGGA CGGAGACGCC
TGA
 
Protein sequence
MSAATDHWLL KAVSDFGRAC GTKLSGGGSP EASIRSPLEG LLAAVGRHHK LSELTWHDEV 
RLPELGVRPD YAVRLSGLVT GYIEIKKPGL SVDPQNFTGH NKEQWERLRD LPNLLYTNGT
EWRLFRDGIQ IGETIHFTGT LRTSRERLRP PDPAAFDALI TTFLVWSPPP IQNVGRLVQN
IAPLCRLLRA AVLEQLTAEA KSDAPENDGH ARPFTGLQND WRTLLFPEAD SRTFADGYAQ
TVTFALLLAQ TEGIPLLQVG FHDIGRKLNA GHALIGSALR LLTDNINERF SVTLDLLTRT
ISSVDWPAIR NGNRDAYLHL YENFLTRYDA QLRQQSGSYY TPREVVEHMV RLAEDVLRTR
LGKDHGYADP DVRIVDPAMG TGTFLHAIIE RVAETASEGG GEGMEIDAVA QLAERLYGFE
LQIGPYAVAE LRTSDLLRAE EIPAPREGLN LFLTDTLDSP FSDTQKALFG YRELAASRQR
ADQVKGNVPV TVVIGNPPYD DKAKKRGKWA EKKIPGENRT PLDAFRHPGN GRYEHVLKNM
YIYFWRWATW KVFDAHEADQ HGLVCFITPS GFSTGPGGRG LRDYLRRTCH EGWVINLSPE
GQRADVATRV FPAVAQPLGI YIFVRRAGSS PDGSTRIHYR SISGRREEKF RQLAHVEIDD
GGWRDTHRER SRPFTPTTQS AWEDFPEISD IYPWGSPGVK ANRSWVAAPS AEILRRRWAR
LVREDDPDAK EELFKETRDR TLLRAVSPLP GYSGPRVPIG RESSLTPPVI RVGFRSFDRQ
WLIADPRVLD FARPDLWASL HDDQIFLNQQ SSHEIESGPA VVATALLPDT HHFNGRGGRI
LPLLQPDGSA NVPGGLLSYL ASAFGLEKVT VLDLAAYTVA VAGHSAFTEQ FAEELLTPGV
RLPVTRNLES WRKALSIGAE ILWASTYGER CADPDAGRPT RDVRFKSGDP RQVRYLTNIR
KQIPENFHHD PTTNTLHVGA GSFGNVPEEV LTFNVGAMPV VRKWFGYRKF SPNSKKTSPL
DDIHVDSWPR EWAAELVELL SALRRLVDLA PAQRDLLAEV LAGPMVTQQD LAVAGVLPVR
PAARKPRYEN PVGLFVDGDA