Gene Francci3_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1870 
Symbol 
ID3906145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2204950 
End bp2206461 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content69% 
IMG OID637879208 
Productphage integrase-like SAM-like 
Protein accessionYP_480975 
Protein GI86740575 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.254784 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTCGTGG TTCGGCGCCG TGTCGTGGTT CTACCGTCGA CTCCCATGTT GCTGATCTTC 
TTCTCGTCAC GGGGGTGGGA GTCGTGGGAT GTCGAGGCAG CGCCGCTGAT CCCCGAGCGC
ATGCCGGTGC TGGTCGACGA CGATCTGCGG TTCGAGGACG GGCCGGGGTG CGTGCGGCCG
GCGGCGGTGG TGAACCGGTG GCTTCGTGAG CTTCCGGCGT GCGGTGTGCC GGCGCCGCGG
TCGTGGGCGG CGTATGCGCG GGCGGTGAAG GACTGGGTGG AGTTCCTGGC CGGGCACGGC
ATCGACGTGC TCGGCCCGCG CGAGCAGCTC AAACTGGCAC TTGGGAAATA TGCCGAGGAC
CGGTCGGCGG GGCCGGTGGA GCGGCGGCTG GCGGCGAGCA CGTGGAGCCA GCACATAAGC
ATCGTGAGCA TGTTCTACCG GTGGGCGATG GCCGAAGGCT ATGCGTCGGC CGAGCCGTTC
ACCTACCGGA CCGCGAAGGC CATGGTCGGC GGCATGGTGC GTGACGTCCG GGTGAACCTG
GCGATGCGCC GAACTCCGAA ACCGCACGTC ACGATCAAAT ATCTGGAAGC CGACTTTGCG
ACGTTGCTGC TGAACGCGCT GGCCGGGCTG GACCGGGACG GTCAGGCGGA TGCTGGCTAT
GCGGGGCGGG AGCTTGCCCG TAACCGGGCG GTGGTCGGGT TGGCGTTGGC GACGGGCCTG
CGGTTGCAGG AGTTCTCGTC GCTGCTGGTC TATGAGATCC CGACGCCGCC GTCCCGGCCG
GGGGCGCTGC CGGTGCCGTC CGGGGTGGCC AAGGGCCGTA AGTTCCGTAC CAGTTGGATC
AGCGTCGAGG CGCTGAGGGT TGTGCATGAT TATGTGACGC TGGACCGGGC CGCCGCGGTG
GAAGGAGCAT CATGGTGTCC GCCGGCCCGG TGGGGTCCTG CGCTGCGGGT GAGCGAGCCG
GACATGCGTG GCGGGCGGAT TAACGGGGTG CGCCGGGACT GGGATGCGCT GACGCCGGGC
GAGCGGCGGC GGCTGGTCGC TCCGGATGGT GGTTCGTGCC TGTTGGCGGT CCGCTCGGAC
GGCGGACCGT TCACCGCCTG GGCGACCGTT TTGGAACGCG CGTCGGATCG TATCCGGGCA
CGTGTCGAGC TGCGGTTCCC GCATGTTCAT CCGCATCGGC TGCGGCATTC TTTCGCGATG
GCCACGTTGG AACGGCTGGT GGATGGGCAT TACCGGCGGG CCGCCGAACT GGTGGCGGCC
GGTGGAGACG GCGGGCCGGA CGCCGCGTTG GCGTTGTATC TGAGCAAAGC CGATCCGCTG
CTGGTGCTGC GGGATTTGAT GGGTCACTCA TCGGTCACCA CGACGGAGGC TTACCTTCGC
CGGTTGGACA CCACCCGGAT TTTCGGGGAG GCGTATGCGC GAGCCGGCGC GGCGGCCGGC
CTGGCCGATG ATCCGGCGGC GGACACCGAG GTCGCGGCGG AGTTCACCGA CGAACCTGGC
GAGGACGACT GA
 
Protein sequence
MVVVRRRVVV LPSTPMLLIF FSSRGWESWD VEAAPLIPER MPVLVDDDLR FEDGPGCVRP 
AAVVNRWLRE LPACGVPAPR SWAAYARAVK DWVEFLAGHG IDVLGPREQL KLALGKYAED
RSAGPVERRL AASTWSQHIS IVSMFYRWAM AEGYASAEPF TYRTAKAMVG GMVRDVRVNL
AMRRTPKPHV TIKYLEADFA TLLLNALAGL DRDGQADAGY AGRELARNRA VVGLALATGL
RLQEFSSLLV YEIPTPPSRP GALPVPSGVA KGRKFRTSWI SVEALRVVHD YVTLDRAAAV
EGASWCPPAR WGPALRVSEP DMRGGRINGV RRDWDALTPG ERRRLVAPDG GSCLLAVRSD
GGPFTAWATV LERASDRIRA RVELRFPHVH PHRLRHSFAM ATLERLVDGH YRRAAELVAA
GGDGGPDAAL ALYLSKADPL LVLRDLMGHS SVTTTEAYLR RLDTTRIFGE AYARAGAAAG
LADDPAADTE VAAEFTDEPG EDD