Gene Francci3_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0496 
Symbol 
ID3903016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp580535 
End bp582106 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content72% 
IMG OID637877826 
Productintegrase 
Protein accessionYP_479610 
Protein GI86739210 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.772953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATACGG CGCGGGGCGT GCTCGCCGAG TTCAGCCACG CGAGCACGCC GACTCAGGAC 
TTGGTTCTCG CGGCGATCGA GGCTCGGCTT GAGCAGGAGC ACGGCCCGGG GGTGGTGCGC
CTGCCGGGTC GGACCCGGGC GCGGGCCCTG CTGCGGGAGC TGAGCCGCGG CACCAGCGCG
TTCGGTGGAG CCAAGGGCCG GCGGGAGATC GCGGGCCGCC CGGTGGCGCC TTACGGGAAG
CTGCGGGCGC ACCGGCCGGG TGAGTACCTG CTGGTGGACA CGACCAGGTT GGACGTGTTC
GCGATGGAGC GGGTGACGCT GCGCTGGGTG CAGGCGGAGC TGACGGTCGC GATGGACCGC
TACGACCGGT GCATCACCGG GCTGCGGCTG ACCCCGGTGT CGACGAAGGC GGTCGACGCC
GCGGCGGTGC TGTTCGAGTC GATCCGTCCG TTGCCGGAGC CGGCGGCGGG CTGGGTGGAT
GTCCGCCCGC CCTATCACGG TGTTCCTGGG CGGGTGGTGG TCGATGTCGA GCGGCTCGTC
GACGCCGGCG GCGTCCCGCT GTTGCCGTCG GTGGCGGCGG AGACCCTGGT GGTCGATCAT
GGCCGGATCT ATCTGTCGGA GCATCTGCTG TCGGTCTGCC AGCGGCTGGG GATCTCGGTG
CAGCCGGCGC GGGTCGCCCA GGCCACCGAC AAGGCAGCGG TCGAACGCTT TTTCCGCACG
CTGCGCGAGC AGCTACTCGT CGCGCTTCCC GGCTACAAGG GCCCGGACGT CCACCACCGG
GGCGCCGATG TCGAGGAGCA GGCGTTCTAC TTCCTCGACG AGCTCGAAGA ACTCATCCGC
CAGTGGGTCG CGGACTGCTA CCACCGTCAG CCCCATGGCG GCCTCGTGGT CCCGGAGGTG
CCGGGGCTGG CGGTGTCGCC GTTGGAGATG TTCGCCCACG GGGTGGCGCG GGCCGGTCAT
CTCCAGGTGC CCGCGCGGGC GGACCTGGTC TTCGACTTCC TGGCGGTCGA ATGGCGCACG
ATCCAGCACT ACGGGGTGGA GATCGGCGGG CTGCGCTACG ACGGGCCCGC CCTGTCGCCC
TACCGCAACC GGACCAGCCC GCACACTGGC GTCCACGCGG GCAAGTGGCC GATCCGCGTC
GACGCCGATG ACGTCAGCCG GGTCTACTTC CAGGACCCGG CCGACCAACG CTGGCATGTG
CTGCGCTGGG AGCACGCCGA CGCCCTGGGC GGCCCGTTCA GCGCGGACGC GCTGGCCTAC
GCCCGCCAGC TCGCCACCGC GACCGACCGG TTCCCCGACA CCCGCCGAGC ACTGGCCCGG
CTGTTGGAAC GCTGGGACGC GGGCCTGGCC GGCAACCGGG CCGAGCGGCG CATGGCGGTG
CGTCTGTCCG AACGGCGGCT GCGTCTCGTC GGCGACACGG CCGTCCCGGA CGAACCCGCC
CCGGCGGTCG CCTCGCCCGA CCAGGACCGT TCGGCCGAGG AGACGGCGGG CGATGACGAC
CGCGACGACG AGCTTGGCGC CCCGTTCCCT GGCGAAGACG ACTTCTACGC CGACGCGATG
GAGATCGTGT GA
 
Protein sequence
MDTARGVLAE FSHASTPTQD LVLAAIEARL EQEHGPGVVR LPGRTRARAL LRELSRGTSA 
FGGAKGRREI AGRPVAPYGK LRAHRPGEYL LVDTTRLDVF AMERVTLRWV QAELTVAMDR
YDRCITGLRL TPVSTKAVDA AAVLFESIRP LPEPAAGWVD VRPPYHGVPG RVVVDVERLV
DAGGVPLLPS VAAETLVVDH GRIYLSEHLL SVCQRLGISV QPARVAQATD KAAVERFFRT
LREQLLVALP GYKGPDVHHR GADVEEQAFY FLDELEELIR QWVADCYHRQ PHGGLVVPEV
PGLAVSPLEM FAHGVARAGH LQVPARADLV FDFLAVEWRT IQHYGVEIGG LRYDGPALSP
YRNRTSPHTG VHAGKWPIRV DADDVSRVYF QDPADQRWHV LRWEHADALG GPFSADALAY
ARQLATATDR FPDTRRALAR LLERWDAGLA GNRAERRMAV RLSERRLRLV GDTAVPDEPA
PAVASPDQDR SAEETAGDDD RDDELGAPFP GEDDFYADAM EIV