Gene Franean1_5114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5114 
Symbol 
ID5673449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6124272 
End bp6125657 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content71% 
IMG OID641243965 
ProductIS605 family transposase OrfB 
Protein accessionYP_001509379 
Protein GI158316871 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.279407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.776114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGTTC TCCAGGCGTA CCGGTTCGCA CTCGACCCGA ACCAGGCGCA GCTTGCCGAC 
CTGCGGCGTC ATGCCGGGGC GGCCCGGTTC GTGTTCAACT GGGGTCTGGC CCGGGTGAAA
GCCGCCCTGT CGCAGCGAGA CGCCGAGCAG TCCTACGGCC TGACCGGGGA CATGCTCACC
CCGGTGCCGT GGACGCTGCC CGCGCTGCGC CTCGCCTGGA ACGAGGCCAA GAGCACCGTC
GCCCCGTGGT GGTCGGCGTG CTCGAAGGAG GCGTACTCGT CCGGCCTCGA CCAGCTCGCC
CGCGGGTTGA AGAACTTCAC TGATTCCCGC ACGGGGAAGC GGAAGGGGAA GCGGGTCGGT
TTCCCCCGGT TCAAGAAGCG TGGCAGGGCC CGCGACTCGT TCCGGTACAC GACCGGCGCC
TACGGCCCGG CCAGCAACCT ACAGGTGAAG CTGCCCCGCC TGGGCCGGGT CAAGGTCCAC
GAGGCGATGG GTGCGCTCAC CGGCCGGTTG GCGGCCGGGT CCGCCCGGCT GCTCGGTGCG
ACCGTGTCGC GCACGGCGGG CCGCTGGTTC GTGGCGTTCA CCGTCGAGAT CGATCGTGAG
ATCCCCCAGA ACCCGTCGGC CCGCCAACGT GCGGGCGGCG CGGTCGGTGT CGACGCGGGA
GTGAAGCACC TCGCTGTCCT GTCGACCGGT GAGCAGGTCG ACAACCCCAG GCCACTCACC
CGCTCGCTGC GCAGGCTGCG CACTGCGTCC CGGGCCTGTG CCCGCTCGAA GCCGGGCAGT
GCCGGCCGCC GCCAGCGCGC TGCCGCACTG GGCCACCTGC ACGCCCACAT TGCCCACCAG
CGGCGCGATG GGCTGCACAA GCTCACCACC CGGCTCACGA AGAACCACGA CGTGATCGTG
GTCGAGGATC TGCATGTCGC CGGGATGGTC CGTAACCGCC GGCTCGCCCG CGCGGTCTCG
GACACCGGGA TGGCCGAGAT CCGGCGCCAA CTCACCTACA AGACCGTCTG GTACGGATCG
CGGCTCGTCG TCGCGGACCG GTGGTATCCG TCGAGTAAGA CCTGTTCCGG CTGTGGCTGG
CGAAACCCAA GCCTCACCCC GGCCGACCGC ACGTTCGCCT GCCAGTCCTG CGGGCTGGTG
ATCGACCGCG ACCTGAACGC CGCGATCAAC CTGCGCAACC TCGTCGCCGC CAGTACGTCG
GAGACGGAAA ACGCCCGTGG AGCCGACCGT AGGACCCAGC CTGCTGGGCG GGTGGCTGGG
AAGCGGGAAC CCGGCACCGT GGGCATGACC CCGTGCGAGC GGGTCAGACC GGGGGTGCCT
CACCGAGAGG CGAGGCGGCA TGACCGGGCG CTACCAGGCG CTCACATGCA ACGGCGGTGG
CACTGA
 
Protein sequence
MKVLQAYRFA LDPNQAQLAD LRRHAGAARF VFNWGLARVK AALSQRDAEQ SYGLTGDMLT 
PVPWTLPALR LAWNEAKSTV APWWSACSKE AYSSGLDQLA RGLKNFTDSR TGKRKGKRVG
FPRFKKRGRA RDSFRYTTGA YGPASNLQVK LPRLGRVKVH EAMGALTGRL AAGSARLLGA
TVSRTAGRWF VAFTVEIDRE IPQNPSARQR AGGAVGVDAG VKHLAVLSTG EQVDNPRPLT
RSLRRLRTAS RACARSKPGS AGRRQRAAAL GHLHAHIAHQ RRDGLHKLTT RLTKNHDVIV
VEDLHVAGMV RNRRLARAVS DTGMAEIRRQ LTYKTVWYGS RLVVADRWYP SSKTCSGCGW
RNPSLTPADR TFACQSCGLV IDRDLNAAIN LRNLVAASTS ETENARGADR RTQPAGRVAG
KREPGTVGMT PCERVRPGVP HREARRHDRA LPGAHMQRRW H