Gene Francci3_1763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1763 
Symbol 
ID3906829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2095985 
End bp2097292 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content72% 
IMG OID637879101 
Productputative transposase 
Protein accessionYP_480868 
Protein GI86740468 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.24112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGTGG GGTGGGGATC CGGGTCTGGT TGGCTGCTTG CTGTGACGCC GGAGGAGATG 
GCGGTGGTGC GTCCGGTCGT GGAGGAGTTC GCGGCGCGGA TGTTCGTGGA TCTGCCGCGT
AGGGATCAGC GGGCCAAGGG TGAGCTGTAT CTGCGGGGGC TGCTGTTGGA CGGGAAGCGC
AAGAGCGTGC AGCCGATGGC GGAGCGCCTC GGGGTGGATC ATCAGCAGTT GCAGCAGTTG
CAGCAGTTCC TGCGGAACTC GACGTGGGAC TACGCCGAGG TTCGCGCCCG GCTGGCCGGC
TGGGCGGTCG GGTTCGTCCG CCCGGAGGCG CTGGTCATCG ATGACACGGG CTTCGTGAAG
GACGGGGCGG CGTCGCCTGG GGTGGCCCGG ATGTACTCGG GGACGTTGGG CAAGGTCGGG
AACTGTCAGA TCGGGGTGTC CGTGCACGCG GCGACGGACT GGGCGTCGGC GGCGCTGGAC
TGGCGGCTGT TCCTGCCTGC CAGCTGGGAC GACACCGGCC TCGACGACCC GGACGAGGCC
GCGGTGGTGC GTGCCCGCCG CGAGAAGGCA GGTGTTCCCG ACGGTGCGCG GCATCGGGAG
AAGTGGCGTC TGGCGCTGGA CATGCTCGAC GAGATCGCCG GCTGGGGTGT GCCCGCCCGG
CCGGTACTCG CCGACGCCGG CTACGGCGAC TGCGCGCAGT TCCGCCAGGG CCTGACCGAC
CGCGGCCTGG TCTACACGGT CGGGGTCACC CCGACCGCCA CCGCCCACCC GGGGGCCGCC
GAACCCGTGA CCGCGCCGTA CACGGGCAGC GGTCGCCCGC CGCTGCCTGC CTACCCGGAC
CCGCCGGCGA CCTTGAAAGC CCTGGTCCTG GCCGCCGGCC GCGGTGCGCT GCGGTGGGTG
ACCTGGCGGC GCGGCACCCA CAAGACCCCC GGCAACACGG CCGCGCTGAT GCGCTCCAGG
TTCCTGGCCC TGCGGGTCCG CCCCGCCGCC AAAGCCCTGA CCCGAGACGA TGACCGCAGC
CTGCCCGCCT GCTGGCTGCT CGCCGAATGG CCCCCTGGCA AGAACGAACC CACCGACTAC
TGGCTGTCCA CCCTGCCCCC GGACATCCCC CTGCGCCGAC TCGTCCGGCT CGCGAAGCTG
CGCTGGCGCG TCGAGCACGA CTACCGCGAA CTCAAAGACG GCCTCGGCCT GGACCACTAC
GAAGGCCGAT CCTTCGACGG CTGGCACCGC CACGTCACCC TCACCTGCCT CGCCCAAGCC
CTCTGCACCC AGCTGCGCCG CACCCCAAAA GCCCCTGCGC CGGCCTGA
 
Protein sequence
MVVGWGSGSG WLLAVTPEEM AVVRPVVEEF AARMFVDLPR RDQRAKGELY LRGLLLDGKR 
KSVQPMAERL GVDHQQLQQL QQFLRNSTWD YAEVRARLAG WAVGFVRPEA LVIDDTGFVK
DGAASPGVAR MYSGTLGKVG NCQIGVSVHA ATDWASAALD WRLFLPASWD DTGLDDPDEA
AVVRARREKA GVPDGARHRE KWRLALDMLD EIAGWGVPAR PVLADAGYGD CAQFRQGLTD
RGLVYTVGVT PTATAHPGAA EPVTAPYTGS GRPPLPAYPD PPATLKALVL AAGRGALRWV
TWRRGTHKTP GNTAALMRSR FLALRVRPAA KALTRDDDRS LPACWLLAEW PPGKNEPTDY
WLSTLPPDIP LRRLVRLAKL RWRVEHDYRE LKDGLGLDHY EGRSFDGWHR HVTLTCLAQA
LCTQLRRTPK APAPA