Gene Francci3_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0144 
Symbol 
ID3903421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp171894 
End bp173606 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content74% 
IMG OID637877478 
Producttransposase IS66 
Protein accessionYP_479267 
Protein GI86738867 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.996772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGTGGT GCGTGGCGGT TGTCGAGTCG GGTGTTTGCG GCGTTGCTGG TGGTGAGGCG 
CTGGCGGATG TGTTGGTGGA GAACGGCCGT CTGCGAGTCG AGCTCGCGGA GCTGGGGACG
GAGCATGCCC GGCTGCTCGC GCGGGAGACC GCGGCCGCGG CCGAGCTTGA GGCGTTGCGG
GCGGAACTGG CGACGTTGCG TCGGATGCTG TTCGGCCGGT CGTCGGAGCG CGCGTCGGGC
GGGCCGCCGG CCTGCGCCGG CGCAGGGGAC AGCGGTGATG GTCGGGACGG GGCGGCTGGT
GAGGGGGCTG GTGGGTCGAC GGGCCGGCCG CGGGGGCCGG GGGCGCGTTC GGGCCGGCGC
AGCTACGACC ACCTGGCGCG TGACGAGGTC GACGGCGACT TCGAGGGCGG GGGTTACACG
TGCCCGTCGT GCGGGGCGTC GTTCACGCCG TGGGGTGAGC ACGTCGTCGA GCAGCTCGAC
TGGGTGGTGT CGGTGCGGCT GCGGGTCACG CGGCGTCGCC GCTACCGGCG AGGCTGCCGG
TGCGGGGGGC CGGTGACGGT GACCGCGCCG GGCCCGTCGA AGGCGATCGG GAAGGGTCTG
TTCACGCACC GGTTCCTCGC CCTGCTGGTC GTGGAGCGCT ATGTCGCGGG CCGTTCCCAG
AACTCACTGG TCACCGGCCT GTCCCGGCAG GGGGCGGAGA TCTCGCCGGC GACGCTGACC
GGGGCGTGCG CCCAGGTCGG CGACCTACTC GCCCCGCTCG CGGAGAAGAT CGCCGGGCGG
TCGCGGGGGG CGTGGCACCT GCACGCCGAC GAGACGACCT GGCGGGTGTT CACCCCGACC
GGCGGGGACG GCTCGGCGCG CTGGTGGCTG TGGGTGTTCC TGGGGCCGGA CTCGGTGTGT
TTCGTGATGG ACCCGACCCG CTCGGCCGCG GTGCTCGCCG ACCATGTCGG GATCGATGCC
GAGACCGGGC AGCTGACGGG CGACGACGGC GCCGGCGGCC CGCGCCGGCT GGTGATCTCC
TCGGACTTCT ACGCCGTCTA CGCGTCCGCG GGGGCGAAGG CGGACGGGAT CGTCAACCTG
TTCTGCTGGG TCCACGTCCG GCGGTACTTC CTGCGCGCGG GCGACGCGAA CCCCGCGCAG
CTGGGGATCT GGGCCCGGCA GTGGCGCGAG CGGATCGCCG CGCTCTACGC GGCGCACGCC
GAACTCGCCG CCGCCTGGCA GGCGGCGATC ACGGCCCCGA GCCCGCCGGC CGAGCGGCGT
CTCGCGACCG CCTACACCGG TTGGGACCGC GCGATCGACG CGATCGACAC CGCCCGCCGC
GAGCACATGG CGTCCCCCGG CCTGCCCGAA CCCGCGCGCA GGGCGCTGGC CACGATGGAC
CGGGAGTGGA CCGGGCTGGT CGCCCACCGC AACCATCCCA TGATCGGTCT GGACAACAAC
CCGGCCGAGA GGATCATCCG CAAACCGGTG ATCACACGGC GGAACACCGG CGGCTCCCGC
ACCGACGACG CCGCCCGCCG CGCCGCCACG ATCTTCACCG TCACCGCGAC CGCCGACCTG
CACGGCCTCA ACCCACTGAC CTACCTGGCC GACTACCTCG ACGCCTGCGG TCGCGCCGGC
GCCAGAGCAC CCACCGGGAC GGACCTCGAC CGGTTCCTGC CCTGGGCTGC CAACCCCGAC
GACCTCGCCC GCTGGAGGAA GCCTCCCGGC TGA
 
Protein sequence
MLWCVAVVES GVCGVAGGEA LADVLVENGR LRVELAELGT EHARLLARET AAAAELEALR 
AELATLRRML FGRSSERASG GPPACAGAGD SGDGRDGAAG EGAGGSTGRP RGPGARSGRR
SYDHLARDEV DGDFEGGGYT CPSCGASFTP WGEHVVEQLD WVVSVRLRVT RRRRYRRGCR
CGGPVTVTAP GPSKAIGKGL FTHRFLALLV VERYVAGRSQ NSLVTGLSRQ GAEISPATLT
GACAQVGDLL APLAEKIAGR SRGAWHLHAD ETTWRVFTPT GGDGSARWWL WVFLGPDSVC
FVMDPTRSAA VLADHVGIDA ETGQLTGDDG AGGPRRLVIS SDFYAVYASA GAKADGIVNL
FCWVHVRRYF LRAGDANPAQ LGIWARQWRE RIAALYAAHA ELAAAWQAAI TAPSPPAERR
LATAYTGWDR AIDAIDTARR EHMASPGLPE PARRALATMD REWTGLVAHR NHPMIGLDNN
PAERIIRKPV ITRRNTGGSR TDDAARRAAT IFTVTATADL HGLNPLTYLA DYLDACGRAG
ARAPTGTDLD RFLPWAANPD DLARWRKPPG