Gene Francci3_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1950 
Symbol 
ID3904312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2291191 
End bp2292396 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID637879287 
Producttransposase, IS4 
Protein accessionYP_481054 
Protein GI86740654 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.143043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGTAG TGTCCCCCAT GTTGTGGCTT GAGTGTCAGC AGGAGCAGGT TGAGGTCTCG 
ACAGCCGCGC AGGTCGCGGG GCTGGTCGCG GTGTTGCGGC GGGTACCGGA CTGGCGCGAC
CCGCGGGGTG TTCGCTACGA GCTCGCGCCG GTCCTGGCGT TGTGGGTCGC GGGCAACATC
GCAGGTCACG ACACCACCGT GGCGGTGTGG GAGTGGGCCT GCGCCCTCCC GGTCGGGGTG
CTGGCCGGCC TCGGGTTCCC TCGTCGGGTG CCGTCGGAGC GCACGATCCG GCGGATCGTG
GAGGAAGGCC CGCCCGGGCA GTTGGACCAG GCGCTGTCGG GCTGGACCGC CGCGGTCGCG
CCCGGGCCCG CGGACCCGGG CGGCCTGGTA GCGGTGGCCT TTGACGGGAA GGTGATGAAG
GGCGCCCGTT CCCGCCCGCC ACAGGGGTCG GTGCGCCAGG AGGCGGTCGT CGAGGCCGTC
CGCCACGACA CCGGGACCGC GCTCGGGCAT CAACGGGTCG TGGCCGGCGA CGAGATCGCC
TCGGTACGGA GGCTGGTCAA CCGGGTGTGT GACCACAACA CGCTGGTCAC GACCGACTGC
CTGCACGCAC ACGAACCCTT GGCCCGGGCG ATCCGGGCGA AGGGCGGCCA CTGGCTTTTC
TCGATCAAGG GCAACCAGCC CACCGTGCGG GCGAAGCTGG CCGGGCTGCC GTGGGACGAG
TTCGGAAACC AGCACGTGAC CCGGGAGAAG GCCCACGGCC GGATCGAGGA ACGCGCGCTG
AAGGCGTTGA CCCCCTCCGC GCCGTCCCTG GTCGGGTTCC GAGGTACCCG GCAGGTGGTC
AAGCTGGCCC GGACCACCCG CCGTAAGAAG ACCACGACCA GCCCGGCTGC GACCTCGACC
GAGGAGTTCT ACCTGGTCAC GAGCCTGTCC ACCGACCAGG CCAGCCCCGC GCAGCTGGCC
CGGTGGGCTC GTGGCCACTG GACGGTCGAG GCCATTCACC ATGTCCGCGA CCGGACCATG
GACGAGGACC GCCACACCAT CCGCACCAAG AACGCCGCCC TGAACTGGGC CATCGCCCGC
GATACCACGA TCAGCGCACT ACGCCTGGCA GGCTACAAGA ACATCAGACA GGCCCGCCGA
GCCACCATCA GAGACCCTGG GCTCGTCCTC CAGATCATCG CCCTGACCAG CCAAAACAGA
CTTTGA
 
Protein sequence
MSVVSPMLWL ECQQEQVEVS TAAQVAGLVA VLRRVPDWRD PRGVRYELAP VLALWVAGNI 
AGHDTTVAVW EWACALPVGV LAGLGFPRRV PSERTIRRIV EEGPPGQLDQ ALSGWTAAVA
PGPADPGGLV AVAFDGKVMK GARSRPPQGS VRQEAVVEAV RHDTGTALGH QRVVAGDEIA
SVRRLVNRVC DHNTLVTTDC LHAHEPLARA IRAKGGHWLF SIKGNQPTVR AKLAGLPWDE
FGNQHVTREK AHGRIEERAL KALTPSAPSL VGFRGTRQVV KLARTTRRKK TTTSPAATST
EEFYLVTSLS TDQASPAQLA RWARGHWTVE AIHHVRDRTM DEDRHTIRTK NAALNWAIAR
DTTISALRLA GYKNIRQARR ATIRDPGLVL QIIALTSQNR L