Gene Franean1_1851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1851 
Symbol 
ID5670253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2222811 
End bp2223965 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content70% 
IMG OID641240772 
ProductIS605 family transposase OrfB 
Protein accessionYP_001506195 
Protein GI158313687 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0177268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCAACG ACGCGATCCG GGCCCGCGAC GAGGCGTACA AGGCCGGCGA GAAACTGTCG 
GACACCGAGG TTCAGCGCCG GGTGGTCACC CTCGCGAAAC TCACCGACGA GCGAACCTGG
CTGTCCGAGG TGTCGTCGGT GGTACTCGTG CAGGCGTGCC AGGACGCACG CCGGGCGTTC
CGGAACTGGT TCGACTCGCT GTCCGGGAAG CGGAAAGGCC GGCAGGTCGG CCATCCGCGG
TTCCGGTCAC GGAAGGACAA CCGGCAGTCG ATCCGCCTCA CCCGCAACGG CTTCACCGTC
ACGCCCCGAG GGGTGCGGGT GGCGAAGGTC GGAGATCTGC GGCTGGCCTG GTCGCGTCCG
CTGCCCTCGG TTCCGACGTC GGCGACGGTG ATCCGGGAGG CGGACGGCAG GTACTACGTG
TCGTTCGTCG TCGACGTCGA CGACGTCCCC TCCCCGGCGA CAGGCGCCGA GATCGGCGTC
GACCTCGGGT TGGACCGGCT CGCGACCCTG TCAACCGGAC AGATCGTCGC GAACCCGCGT
CCTCTGCGGT CGCGTCAGCG CAGGCTCGCC CGCGCACAGC GGGCACTGGC CCGCAAGCGG
AAGGGTTCGG TGAACCGGCG CAAGGCGGTC CGCCGGGTCG CGGTCGAACA TCGGAAGGTA
CGGGACACCC GCCGGGATCA TCATCACAAG CTCGCTGCTC GGCTGGTCCG CGACAACCAA
GCGGTCTACG TCGAGGATCT GGCGGTAGCC GGGCTGGCTC GTACGCGGCT GGCCCGGTCG
GTGCACGACG CGGGCTGGTC GATGCTGGTC GGTCTGCTCG AGGAGAAAGC GGCCCGGTGT
GGCCGGGCCG TGGTGAGGGT GGGCCGGTTC TTCCCGTCGT CGCAGGTCTG CTCGGCCTGC
GGCCACCGGG ACGGCCCGAA GCCTCTCCAG GTCCGGACGT GGACCTGTCC GGGGTGCGGT
GTCAGCCACG ACCGGGACCT GAATGCCGCG CGGAACATCC TCGTCGAGGG TCAGCGCCTG
GTCGCCGCCG GGCGGAAAGG CGTGGCTGCA ATGCCACGTC AGGCGGAGAC CGTAAACGCC
TGCGGAGCCG ACGTGAGACC CGGACCCCTC CGGGCAGCTG GCTGTGAAAC AGGAACCCAC
CGAGGTGCCG CGTGA
 
Protein sequence
MFNDAIRARD EAYKAGEKLS DTEVQRRVVT LAKLTDERTW LSEVSSVVLV QACQDARRAF 
RNWFDSLSGK RKGRQVGHPR FRSRKDNRQS IRLTRNGFTV TPRGVRVAKV GDLRLAWSRP
LPSVPTSATV IREADGRYYV SFVVDVDDVP SPATGAEIGV DLGLDRLATL STGQIVANPR
PLRSRQRRLA RAQRALARKR KGSVNRRKAV RRVAVEHRKV RDTRRDHHHK LAARLVRDNQ
AVYVEDLAVA GLARTRLARS VHDAGWSMLV GLLEEKAARC GRAVVRVGRF FPSSQVCSAC
GHRDGPKPLQ VRTWTCPGCG VSHDRDLNAA RNILVEGQRL VAAGRKGVAA MPRQAETVNA
CGADVRPGPL RAAGCETGTH RGAA