Gene Francci3_3390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3390 
Symbol 
ID3905972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4018557 
End bp4019699 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content68% 
IMG OID637880712 
Productphage integrase 
Protein accessionYP_482473 
Protein GI86742073 
COG category[L] Replication, recombination and repair 
COG ID[COG4973] Site-specific recombinase XerC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.457937 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGTC CCAGTCTAGA TCTCGGTGTC GGCGGAAAGA TCTTCTACAG CGCCACGGCG 
AAGGGCTCCC GGGCACGCTG CTTCTACCGC GACCATGACG GGGTACGCCG CGAAGTCGAA
CGCGGCGGCA CCTCCAAGGC GGCGGCCACA CGGGCACTGA AACTCGCGCT GCGTGACCGG
CTGCGGGTCG CGGTCGGCGA CGGCGACATC ACACCGGAGA CGACTATGAA GGTCCTCGGT
GAGGCGTGGT TTGCCGAACA GCAGAAGAAA GACCGCTCTC CCAACACCCT CGCGGCCTAC
CGCACCACCC TCGACCGGCA CGTCTACCCC GCACTCGGCG GAGTGAAGGC CCGACAGGTC
ACGGTCGGAA CCGCGGACCG GTTTTTCAGC GCGGTCACGA CCAAAAGCGG CCCCGGTGCG
GCGCGGATCG CGCGAACAGT GCTGTCCGGC ATGTGCGCGA TGGCCGCCCG GCTCGACGCA
ATGGACCGCA ACGTGGTCCG CGACGCCGGG CAGATCACCC GACCGGAACC GAAGCCGGTA
TCCAAGGCGC TCGGCGCAGC CCAACTGCGG CAGTTGCGGG CGCTGCTCAC CTACGACGAA
CGGGCGCGGC GCCGCGACAT CCCCGATCTT GTGGACATGC TCATCGCCAC CGGCGCGCGC
ATCGGGGAGG TATGCGGGAT CGTCTGGGAC GCGGTCGACC TGGACGCGGG CACCGTGGAA
ATCCGGTCCA CCGTGGTGCG GATTACCGGC CAGGGTCTGA TCAACAAGCC TCGTCCGAAG
TCGAAGGCGG GCCACCGGCT GTTGCTACTC CCGGCCTGGG CCGTGGCCAT GCTGCGCACC
CGCCACCACG GACAGAACAG TGACGAGGTG GTGTTCCCCG CGCAGATGGG CGGCCTACGC
GACCCGAGTA ACACTCAGGC CGACATCCGC GACGCCGTGA ACGATGCCGG CTTCCCCGGC
CTGACGTCCC ACCTGTTCGG CCGCAGGTCC GTTGCCACCC TCCTCGATGG GGATGGGCAT
ACCCCCCGCC AGATCGCCGA TGTCCTCGGC CACGCCAACC CGTCCATCAC CCTGTCCACC
TACATGGGCC GGAAGGTCTC GAACCCCGGC GCGGCGGAAA CCCTCGCGGT CCTGGCCATA
TGA
 
Protein sequence
MARPSLDLGV GGKIFYSATA KGSRARCFYR DHDGVRREVE RGGTSKAAAT RALKLALRDR 
LRVAVGDGDI TPETTMKVLG EAWFAEQQKK DRSPNTLAAY RTTLDRHVYP ALGGVKARQV
TVGTADRFFS AVTTKSGPGA ARIARTVLSG MCAMAARLDA MDRNVVRDAG QITRPEPKPV
SKALGAAQLR QLRALLTYDE RARRRDIPDL VDMLIATGAR IGEVCGIVWD AVDLDAGTVE
IRSTVVRITG QGLINKPRPK SKAGHRLLLL PAWAVAMLRT RHHGQNSDEV VFPAQMGGLR
DPSNTQADIR DAVNDAGFPG LTSHLFGRRS VATLLDGDGH TPRQIADVLG HANPSITLST
YMGRKVSNPG AAETLAVLAI