Gene Francci3_1144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1144 
Symbol 
ID3906623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1357810 
End bp1359078 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content69% 
IMG OID637878475 
Productphage integrase 
Protein accessionYP_480252 
Protein GI86739852 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.132159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGTA CCACCCCCGG CTCGTCGAAC GGCCGGCGCA AGGAGCGCAG GAAGCGCGCC 
AACGGCCAGG GCTCCATCTA CCAGCGCGGA GACGGCCTGT GGGTCGGCGC GGCCTACGTC
CTGATGCCTG ACGGGACGAC CAAGCGGCGG CCGGTCTACG GCAAGTCGGA AGAGATCGTC
CGCGGCAAGC TCACGGAGCT TCAGGCCAAC TCGGATCAGG GCATTCCGGC GGACGCCACC
GGATGGACCG TCGAACGCTT CCTGACGGTC TGGCTCGCGC AGACGGTGAA GCCGAACCGT
CAGCCGAACA CCTATGTCAC CTATGAGAAG GCGGTTCGGC TGTACCTCAT CCCGGGGCTC
GGGAAGAAGC GGCTGAACAG GCTGAGCGGT GCGGACGTCC GGCAGTTCAT CCGGCGGACG
GAAACCACCT GCCGGTGCTG CGAAATGGGC AGGGACAAGG CGCGGCCGGA ACAGGAACGG
CGCTGCTGCG CGGTCGGCCA GTGCTGCAAC CGCCGTCCAT CGAAGCGTCA GGTTCAGGTC
GTGCATGCCG TGCTCCGCAA CGCGCTCCAG GCGGCGGTGC GCGAGGAACT GATCCGACGG
AACGTGGCGA AGCTCGTCCA GGTCTCGACC CCTCGTTACA ACGTCGACCG GGGTCTGACC
GTCGAGCAGG CACACCGGCT TCTCGATGAA GCTGCCGGCA ATCGGCTGTA TGCGCTGCTG
GTCCTGGCGC TGTTCCTCGG GATGCGCCGC GGGGAGCTGC TCGGGCTCCA GTGGTCGGAC
ATCGACACCG ACCGGGAGAC GCTGACGATT CGGCACACGC TCCAGCGGGT CGGCGGGGAG
CTGCGGCTGC TGCCGCCGAA GACGGAGGAC TCGGAGCGCA CGCTGTCGCT CCTGGGCCTG
GTGGCCGACG CACTCACCGA GCATCGCAAG CTCCAGCAGG CGGAACGGGA CGCGGCCGGC
GACCGCTGGG TGACGACGGA CCACGTCTTC ACCACGAAGA TCGGAACGGC GATCGAACCG
GACAACCTGC GCCGCTTCTG GATGCCGCTG CGTCAGGCGG CCGGGCTCGA CGGGGTCGTG
TTCCACGGGC TGCGGCACAC CTGCGTGACG CTGCTCCTGG ACCTCGGCGT GCCGCCGCAC
ATCGTCCGTG ACATCGCGGG CCACTCGGCG ATCGAGGTGA CGATGACGAT CTACGCGCAC
GCCTCGATGG GGGAGAAGCG GCGGGCTCTG GGCCGGCTCG ACGGCCACCT GACCGGGCCC
CGACCCTGA
 
Protein sequence
MPSTTPGSSN GRRKERRKRA NGQGSIYQRG DGLWVGAAYV LMPDGTTKRR PVYGKSEEIV 
RGKLTELQAN SDQGIPADAT GWTVERFLTV WLAQTVKPNR QPNTYVTYEK AVRLYLIPGL
GKKRLNRLSG ADVRQFIRRT ETTCRCCEMG RDKARPEQER RCCAVGQCCN RRPSKRQVQV
VHAVLRNALQ AAVREELIRR NVAKLVQVST PRYNVDRGLT VEQAHRLLDE AAGNRLYALL
VLALFLGMRR GELLGLQWSD IDTDRETLTI RHTLQRVGGE LRLLPPKTED SERTLSLLGL
VADALTEHRK LQQAERDAAG DRWVTTDHVF TTKIGTAIEP DNLRRFWMPL RQAAGLDGVV
FHGLRHTCVT LLLDLGVPPH IVRDIAGHSA IEVTMTIYAH ASMGEKRRAL GRLDGHLTGP
RP