Gene Francci3_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1203 
Symbol 
ID3903555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1435345 
End bp1436529 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content67% 
IMG OID637878534 
Productphage integrase 
Protein accessionYP_480310 
Protein GI86739910 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00426039 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.328241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCC GTCGCCGCTT CGGCAGCATT CGCCGCCGCG AGTCCGGCCG ATACCAGGTC 
CGCTATCCCG GGCCGGACGG ACAGCAGCGC ACCGCACCCG AGACATTCGC GCGGAAGAGT
GAAGCGGAAC GGTATTTGAC CCTCATCGAG GGTCAAATAC TGCGCGGAGA ATGGATCGAT
CCGGAGCGCG GAAAGGTTAC CCTGACCGAC TACGCCGCCC GCTGGATTGT CGAGCGGCCC
AATCTCAGGC CACGCACGAT CGGCCTGTAT TCGGGGCTCC TCACCCGGCA TATCACGCCG
TACATCGGGG GTATTCCGAT CGGCAAGCTC ACGACTCCGA TCATCCGCGA GTGGCGGACA
AAGCTACTCG AATCGGGCGT GTCGGTGGGT ACGACGGCGA AGGCGTACCG GCTGCTGCGC
GCCGTGCTCA TGACTGCGGT CCGGGAAGAC GAGCTGATCA GGACCAACCC GTGCCGTATC
CCCGGGGCCG ATCAGGAGAA CGCTCCCGAG CGTCCGGTCC TGACCGTTTC TCAGGTGTTC
GCTCTCGCGG AGAAGCTTGG TGGTCGGTAC CGAGCGCTCG TGCTGGTGAC CACCTTTGCC
TCCCTGCGGT GGGGTGAAGT GGCCGCGCTT CAGCGACGCG ACCTCGACAC CGACGACGGC
ATCGTGCACA TCAGGCAGTC CCTGGTGGAG ATCGGTGGGC AAGGGGTCGT TCTCGGTCCG
CCCAAGTCCC GGGCGGGTGT CCGTACCGTG TCTCTCCCAG CGGTGATTCT GCCGTGGCTT
CGGATCCACC TTGCCGAATA TGTGGCGGAC GATCCGGCGG CATTCGTGTT CACAGGACCG
AAGGGTGGAT TTCTGCGCCG CGGGAACTTC CGGAAGCTCG TGGGATGGTC AGACGCGGTT
GCTGCCATCG GCATGCCGAA CCTGCACTTT CACGACCTGC GGCACACCGG CAACACGCTG
GCGTCTAGGA CCGGGGCGAG TCTGCGGGAC CTGATGACGC GCATGGGGCA CGACTCGCCC
CGCGCCGCGC TCATCTACCA GCACGCATCC ACCGAGGCTG ACGCGGCCAT CGCGGACGCC
CTCAGCGCCG TCCTGGCCGC GCAGCAGGGG AGCGTGCCAC CTCCCGCGCC ACGCCCGGAC
GACACCCCCG AGGACGGGGC CGCGGGGGCT CTCGTGCCGG CCTAG
 
Protein sequence
MAARRRFGSI RRRESGRYQV RYPGPDGQQR TAPETFARKS EAERYLTLIE GQILRGEWID 
PERGKVTLTD YAARWIVERP NLRPRTIGLY SGLLTRHITP YIGGIPIGKL TTPIIREWRT
KLLESGVSVG TTAKAYRLLR AVLMTAVRED ELIRTNPCRI PGADQENAPE RPVLTVSQVF
ALAEKLGGRY RALVLVTTFA SLRWGEVAAL QRRDLDTDDG IVHIRQSLVE IGGQGVVLGP
PKSRAGVRTV SLPAVILPWL RIHLAEYVAD DPAAFVFTGP KGGFLRRGNF RKLVGWSDAV
AAIGMPNLHF HDLRHTGNTL ASRTGASLRD LMTRMGHDSP RAALIYQHAS TEADAAIADA
LSAVLAAQQG SVPPPAPRPD DTPEDGAAGA LVPA