Gene Francci3_0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0407 
Symbol 
ID3903649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp477885 
End bp479396 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content69% 
IMG OID637877736 
Productphage integrase 
Protein accessionYP_479523 
Protein GI86739123 
COG category[L] Replication, recombination and repair 
COG ID[COG4973] Site-specific recombinase XerC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.995866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGAC GGCAACCCCC TGGACGTGGC CTGATGGCCG AGGGTAAGGG CAGCAGACGC 
GGCAACGGCG AAGGCTCGAT CTACCGGTAC CGCAACGGTT GGGCCGGTCA GATCAGTATG
CCGGACGGCA CCCGGCCGAC GTTCTACGGC AAGACGAAGG ACGACGTCCG AGGGAAGATC
ATTGCCGCGC TCCGGGCGGT GCAGGACGGG GTGCCGCTGG TCACGACACG GATGACGGTG
GCCGAATATC TCGATCAGTG GGTGACCATC GTTCTGCCTT CCCGCGTGGT CGCCGGAACG
CTCGCCGAGT CAACGGCCGA GTCGTATGCC GATCAGGTCC GGCTGCATAT CACGCCGATC
CTCGGCCGTC ACGAGCTGCG CAAGCTGTCC ACCGCGCACG TCCGCCACTG GATGACGCGC
AAGCTCACGG AACCGTCGTC CGCGGGTCTG GCGCAGTGCG CGGCCTGGGA GGCTGAGGCG
GCCCGCAAGG CGGCAGAACG CGCGAAAGAG AAGAAGACTC GCCCGGCTCG ACGCGCGACA
GCACAGAAGA AGGCTCCGGC ACCGAGCGAA CGCAAGCCCG TCAAGCCGCT GTCGGCCCGG
TCGGTGCGCT ACCTGCTGGT GATCCTCACC GCGGCGTTGA ATGACGCGGT CCGGGAGGAA
CTGCTGACCC GGAACGTCGC CGCTCTCGTC CAGCCGCCGA GGAAGGCCGA TGCGCCGATA
CGGGCGCTCA CCGAGGAAGA AGCCCGTCGG GTTGTCGCGG TCGCGCTCGT GGACCGGATG
TCCGCGCTGT GGCTGGTGCT GCTGGCGCTC GGGCTGCGCA AGGGGGAAGC GCTCGCGCTG
CGCTGGGATG CGCTCGACCT TGATGCGGGC ACGGTCGCCG TGGTGCGCAA GCAGCGGCGT
CGCAAGGCGG GAATTGATCC GGAGACCGGC CGGCAGCGCT GGGAACTCGT CGAAGAGGAA
GCCCTGAAGA CGAAGGGTTC AAAGGCGGTT CTCGCACTGC CCGACATGGT CGTCACGATG
CTCCGTGAGC ACCGGTGGCG GCAGGACGAC GCGAAAGCCG ACCTCAGCGC CAAAGCGGAC
CAGGCGGAAA AGGACCTGGC GGCGCTGGCC GCGCAGAACC TCCCCGAGGG GGTGTCCGGG
ACTCCGGCGG ACCTGAAGGT CATCCGGTGG GAAGACCCCG GACTGGTGTT CACTACCGCA
CTCGGGCGGA AGGTCGACCC GCGCAACGTC AACCGGTGGT GGGACGCGGT GTGTGAGCGG
GCTGAGGTGC GGCACACCCG CGTGCACGAT CTCCGCCACA CGGCGGCGAC GATGCTGTTC
CGGGCCGGCG TCGACCTGAA CGAGATTCGC GCGCTTCTGC GTCACACCCG CCTTGGCACC
ACCGCGGACA TCTACGTTGA CGTTCTGGCG GACGTCCGGC GGGGGACGGC CCGCAGCATG
GACGACATCC TCACCCGGCT CCGGAACCCG ACCACCGCTA CGGAGAGTGA GGCAGACGAG
GGTGCCGCCT GA
 
Protein sequence
MARRQPPGRG LMAEGKGSRR GNGEGSIYRY RNGWAGQISM PDGTRPTFYG KTKDDVRGKI 
IAALRAVQDG VPLVTTRMTV AEYLDQWVTI VLPSRVVAGT LAESTAESYA DQVRLHITPI
LGRHELRKLS TAHVRHWMTR KLTEPSSAGL AQCAAWEAEA ARKAAERAKE KKTRPARRAT
AQKKAPAPSE RKPVKPLSAR SVRYLLVILT AALNDAVREE LLTRNVAALV QPPRKADAPI
RALTEEEARR VVAVALVDRM SALWLVLLAL GLRKGEALAL RWDALDLDAG TVAVVRKQRR
RKAGIDPETG RQRWELVEEE ALKTKGSKAV LALPDMVVTM LREHRWRQDD AKADLSAKAD
QAEKDLAALA AQNLPEGVSG TPADLKVIRW EDPGLVFTTA LGRKVDPRNV NRWWDAVCER
AEVRHTRVHD LRHTAATMLF RAGVDLNEIR ALLRHTRLGT TADIYVDVLA DVRRGTARSM
DDILTRLRNP TTATESEADE GAA