Gene Franean1_4789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4789 
Symbol 
ID5673130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5716390 
End bp5717370 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID641243645 
ProductIS605 family transposase OrfB 
Protein accessionYP_001509061 
Protein GI158316553 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.232227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCTC GGGTAGTGAA GCGGGCATAT CGTTATCGCT TCTACCCGAA TCCCGAGCAG 
GTCGCGCAGC TGGCTCGTAC CTTCGGCTGT GTCCGCTACG TCTACAACCG GGCGCTGGCC
GAACGGTCCC GCGCCTGGAC CCAGGAGCAG CGCAGGGTCA CGCACGCGGA GACCGACAGG
ATGCTCACCG TGTGGAAGCG GGACACGGAG ACGGCGTGGC TGGCCGAACC GTCGAAAGGG
CCACTGCAGG CCGCGCTGCG CCATCTGCAG GCGGCGTTCG TGAAATTCTG GGAAAAACGG
GCCGGATATC CGTCTTTCAA GAAGAAGGGC AGGAGCCTCG ATTCGGCGAC CTATTTCCGG
AACTGCTTCA CCTACCGGAA CGGGAACATC ATACTGGCCA AGCAGGACCG GCCGTTGGAC
ATCGTCTGGT CGCGTCCGCT GCCCGACAGC GCAGCGCCCT CGCGGGTGAC GGTGTCGCGG
AACGCCCGCG GCCAGTACCA CGTCTCGATC CTGGTCGAGG ACACCGTCAC CAGCCTCCCG
CCGGCTGGGG GGCAGGTCGG GATCGACGCG GGCATCACAG CGCTGGTCAC CTTGTCGACC
GGGGAGAAAG TCACCAACCC CCGGCACGAG CGCCGCGACC GTGTCCGGCT GGCGCGGGCG
CAGCGGGACC TGTCCCGCAA GGCGAAGGGC TCGGCGAACC GGGTGAAGGC CCGCGTGAGG
GTCGCGGAGA TTCACGGCAG GATCGCGGAT CGGCGCCGGG ATCATCTGCG CAAGCTGTCC
ACGAGGATCA TCCGCGAGAA CCAAACGGTG GCCGTCGAGG ACCTGTCCGT CCGCACCATG
GTCCGCAACC ACTCGCTGGC CCGTGCTATC TCCGACGCAT CCTGGTCGGA GTTGCGGGCG
ATGGTGGAGT ACAAAGCCGA CTGGTACGAC GGAACATTCT GGCCTCGGGG CTCGCGGTGT
CCGCCTGTGG AGATGGAGTG A
 
Protein sequence
MGSRVVKRAY RYRFYPNPEQ VAQLARTFGC VRYVYNRALA ERSRAWTQEQ RRVTHAETDR 
MLTVWKRDTE TAWLAEPSKG PLQAALRHLQ AAFVKFWEKR AGYPSFKKKG RSLDSATYFR
NCFTYRNGNI ILAKQDRPLD IVWSRPLPDS AAPSRVTVSR NARGQYHVSI LVEDTVTSLP
PAGGQVGIDA GITALVTLST GEKVTNPRHE RRDRVRLARA QRDLSRKAKG SANRVKARVR
VAEIHGRIAD RRRDHLRKLS TRIIRENQTV AVEDLSVRTM VRNHSLARAI SDASWSELRA
MVEYKADWYD GTFWPRGSRC PPVEME