Gene Francci3_2651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2651 
Symbol 
ID3906324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3131163 
End bp3132365 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content74% 
IMG OID637879976 
Producthypothetical protein 
Protein accessionYP_481742 
Protein GI86741342 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.589056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0568929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGCC GGCATGCGGC CACCCGCTGG GAGGACCGCT GGCAACTGCG CGAGGTCGTG 
CTCGCCTGGC TCGCGGCCCG GCTGGCGGTC GGTGCGGCCC TGGCGCTGAC CAGGTACATC
GCCGACAACG CCCGCGGGCA GCAGGGCGCG CAGCTGAGCA CCACCGACCT GCTCGGTTGG
GACGCCGGGT GGTACAAGAC GATTGCCGAG GTCGGGTACA GCGGCGCTGG GGCCGAGAGC
CGCCGGTTCT TTCCCCTGCT GCCGCTGCTC GTGCGGGGGT TGTCGAAGCT GCCGGGAGCC
GACGCCCACG TCGGCGTCGT GCTCCTGGTC GTGGTCAACG TGTGCGCGTT GGTCTTCGCG
CTGCTGCTGG TGGGGCTCGC CCGGTTCGAG GGGTTCGCCC CGGAGGCGAC CACCCGGCTC
ATCTGGCTCG CCGCGCTGGC TCCACCCGCC TTCGTCCTGG TCATGGGGTA CGCCGAGGCA
CTCGCCGGGC TGCTGGCCGT GGCGGTGTTC CTCGGTGCCC GCAGCGGGCG CTGGGAACTG
GCGGCCGTGG CCGGCCTGCT CGGCGGCCTG TGTCGCCCCC TCGGTCTGAT CCTCGCGGTG
CCGGTGCTTC TCGAGGCGGC CCGGGGGCTG CCTTGGCCGC TCGTCCGCCG GCTCGGCCCC
GGCCCCGGCC CTGGCACGCC GGCCACCACG CCTACGGCTG GAACGCTCAC CGCCCGCGAC
GGGCTGCGTC GACTACTCGC CGTGCTCGCC CCGGTGGCCG GCGCCGGCAT CTACCTGCTG
TGGTCGGCGC ACTCCTACGG TGACGGGCTG GCGCCGTTCA CGCTGCAGCG CGACGCGGCA
CGTCACGGCA GCAGCTCGAA CCCGCTGGCG GTGCTGTGGG ACGCGGCGCG GGGCGCCTTC
TCCGGCGAGC TGGGCACGGC GCTGCACGTG CCGTGGCTGC TCCTGGCGCT CGTGGGGCTG
GTGGTCATGG CGCGCCGGCT GCCAGTGTCG TACCCGGTGT GGAGCGCCCT GGTCCTGGCT
GCCGTGCTGA CCGGCAGCAA CCTGGACTCC GCCGAGCGGT ACGTCTACGG GGCGTTCCCG
TTCCTGCTGG TCGCCGCGCT CGTCACCGCC CGGCGGGAGA TCTTCACCTT CACGCTCGCC
GCCACCACGG CCGCGATGAC CCTCTACGCG ACGCTGGCCT TCACCCTCTC CTACGTGCCC
TGA
 
Protein sequence
MGGRHAATRW EDRWQLREVV LAWLAARLAV GAALALTRYI ADNARGQQGA QLSTTDLLGW 
DAGWYKTIAE VGYSGAGAES RRFFPLLPLL VRGLSKLPGA DAHVGVVLLV VVNVCALVFA
LLLVGLARFE GFAPEATTRL IWLAALAPPA FVLVMGYAEA LAGLLAVAVF LGARSGRWEL
AAVAGLLGGL CRPLGLILAV PVLLEAARGL PWPLVRRLGP GPGPGTPATT PTAGTLTARD
GLRRLLAVLA PVAGAGIYLL WSAHSYGDGL APFTLQRDAA RHGSSSNPLA VLWDAARGAF
SGELGTALHV PWLLLALVGL VVMARRLPVS YPVWSALVLA AVLTGSNLDS AERYVYGAFP
FLLVAALVTA RREIFTFTLA ATTAAMTLYA TLAFTLSYVP