Gene Franean1_2794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2794 
Symbol 
ID5671183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3304957 
End bp3306213 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content72% 
IMG OID641241703 
Producttransposase IS4 family protein 
Protein accessionYP_001507123 
Protein GI158314615 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTGA GTGAGATGGA CCGGGTGCGG CCGGTGATCG AACGGTTCGC GGGTGAGATG 
TTCGCGGATC TGCCGCGGCG GGATCAGCGG GGCAAGGGTG AGCTGTATGT GCGGGGGCTG
CTGACCGACG GCAAGCGCAA GAGCATGGTC CCGATGGCCG CCCGCCTCGG CGTGGACCAT
CAGCAGCTAC AACAGTTCGT GACCAGCTCG ACCTGGGACT ACCGCCAGGT GCGGCGGCGG
CTGACGGGCT GGGCGACCGG GTTCCTCGAC CCGGTGGCGC TGGTGGTGGA CGACACCGGT
TTCCCCAAGG ACGGGCCGGC CTCGCCCGGG GTGGCCCGGA TGTACTCCGG CACCCTGGGG
AAGGTCGGGA ACTGTCAGAT CGGGGTGTCG GTGCACGCGG TGACCGACTG GGCGTCGGCC
GCGGTGGACT GGCGGCTGTT CCTCCCGGCC TCCTGGGACG ACACCGCCCT GTCCGACCCG
CAGGAGAGCG CCGCCGCGCG GGCCCGGCGG GCACACGCGG GGGTCCCGGA CGAGGCGCGG
CACCGGGAGA AGTGGCGGCT GGCCCTGGAC ATGATCGACG AGCTGGCCGG CTGGGGTATG
CCCGTCCGGC CGGTGGTCGC GGACGCCGGC TACGGTGACG CCGCCGAGTT CCGCCAGGGC
CTGACCGACC GGAACATCCC CTACGTGCTG GCGGTGAAGC CGACCGCGAC CGCCTACCCC
GCCGACGCCA CGCCGGTCAC CGCCCCGTAC TCCGGGAACG GCCGTCTGCC CGTGCCCGCC
TACCCCGACC CACCCCGGGA TCTGAAATCC CTGGTCATGG CCGCCGGCCG CCGCGCGGGC
CGGTACGTGA CCTGGCGTCA CGGCACCCAC AAGACCCCGG ACAACCCGAC CGCAGGGATG
CGCTCCCGCT TCCTCGCACT CCGGGTCCGC CCCGCGGGCC GGAACATCAC CCGCAAGTCC
GACCGGAGCC TGCCGGACTG CTGGCTGCTG GCCGAATGGC CCCCCGGCCA GCCCGAGCCC
ACCGACTACT GGCTGTCCAC CCTGCCCACC GAGATCCCGA TCCGCGACCT CGTCCGTCTC
GCGAAGATCC GCTGGCGGAT CGAACACGAC TACCGCGAAC TCAAAGACGG CCTCGGCCTC
GACCACTTCG AAGGCCGGAC CTGGACCGGC TGGCACCACC ACGTGACCCT CGTCAGCATC
GCCCAAGCCC TCTGCACCCA GCTGAGACGA ACCCCAAAAG TCCCTGCGCC GGCCTGA
 
Protein sequence
MELSEMDRVR PVIERFAGEM FADLPRRDQR GKGELYVRGL LTDGKRKSMV PMAARLGVDH 
QQLQQFVTSS TWDYRQVRRR LTGWATGFLD PVALVVDDTG FPKDGPASPG VARMYSGTLG
KVGNCQIGVS VHAVTDWASA AVDWRLFLPA SWDDTALSDP QESAAARARR AHAGVPDEAR
HREKWRLALD MIDELAGWGM PVRPVVADAG YGDAAEFRQG LTDRNIPYVL AVKPTATAYP
ADATPVTAPY SGNGRLPVPA YPDPPRDLKS LVMAAGRRAG RYVTWRHGTH KTPDNPTAGM
RSRFLALRVR PAGRNITRKS DRSLPDCWLL AEWPPGQPEP TDYWLSTLPT EIPIRDLVRL
AKIRWRIEHD YRELKDGLGL DHFEGRTWTG WHHHVTLVSI AQALCTQLRR TPKVPAPA