Gene Franean1_4639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4639 
Symbol 
ID5672982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5532347 
End bp5533531 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content70% 
IMG OID641243497 
Producttransposase IS4 family protein 
Protein accessionYP_001508913 
Protein GI158316405 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGGGG TCGTGGCAGC GGAGGATGTC GTCGGGTGGG AGCGGGAGCT CGCGGCGTCG 
ACGGACGGGC TGGGTGGGTT GTTCAACCGG CCTGAGCCCA GGCGTGTGTT CGGTGACTTC
GTGCGGGCGC TGCTGGCGGA CGTACCGAAG AAGAACTCGT GGGGGCTGGC CGAGCATGCG
GGTTATGCAA CGCCGCGGCC GTTCGAGCAT CTGCTCGACG GGGCTGTGTG GGACGCCGAT
CTGCTGCGCG ACGCGGTGCG GGAGTTCGTG GTCGACCGGC TCGGGTCGCC GGTGGGTGTG
CTGGTCGTCG ATGACACGCA GGCGTTGAAG AAAGGTGACA AGTCGGTGGG GGTGGCTCCT
CAGTACTACG GGCTGACCGG GGACGTCGCG AACGTGCAGA CCATGGTCAT GTGTACCTAT
GCCTCGCCGG CCGGGCACGC GTTCGTGGAC CGGGAGTTGT ACCTGCCCGA GGTGTGGACC
AGCGACCCGG CCCGCTGCCG GGCGGCCGGC GTGCCCACCG ACCGACAGTT CGCCACGAAA
CCCCAGCTCG CGGTGGCGAT GCTGACCCGG GCGGTCGACG CCGGGGTGCC GTTTCGCTGG
GTCGTCGCCG ACAGCGGCTA CGGCAAGGAC GCCCGGCTGC GGGGGTTCTG CCACGACCGG
GGGCTGTCCT ACGTGCTGGC CGTCCCGAAG AACCTCGCCC TCCTCGACGC CCGGGGCCGG
CCGACCCGCC CGGACCGGTT ACACGCCCGG CTGCCCGTGG GAGTGTTCGA GCGCCGTTCG
TGCGGCGCCG GGTCGAAAGG CGCCCGCTGG TATGACTGGG CCGCCCACGC GGTCACCGTC
GCCGGAGAGG ACCCGGCCAG CGGGCACGCT CACACCCTGC TGGTGCGTAA GTCCACCACC
CCGCGTACTC GTGACGGCAA GACCTTCTAC GACGTCGAGT ACTTCCTCGC CCACGCCCCG
ACCGCGACCG GCGTCCCCGA CCTGGTCGCC GCCGCCGGGA CGAGGTGGAC CATCGAGGAA
AACAACGGCC AGGGCAAGGA CGTCCTCGGT CTCGACCAGT ACCAGGTCCG GAAATGGACC
CCCTGGCACC GACACGTCAC CCTCAGCATG CTCGCCCAGG CGTTCCTCGC CGCGACCCGC
GCCAACCCGG GAAAAGACCC CCGCATCCAG GAGGCCACCA GCTAA
 
Protein sequence
MVGVVAAEDV VGWERELAAS TDGLGGLFNR PEPRRVFGDF VRALLADVPK KNSWGLAEHA 
GYATPRPFEH LLDGAVWDAD LLRDAVREFV VDRLGSPVGV LVVDDTQALK KGDKSVGVAP
QYYGLTGDVA NVQTMVMCTY ASPAGHAFVD RELYLPEVWT SDPARCRAAG VPTDRQFATK
PQLAVAMLTR AVDAGVPFRW VVADSGYGKD ARLRGFCHDR GLSYVLAVPK NLALLDARGR
PTRPDRLHAR LPVGVFERRS CGAGSKGARW YDWAAHAVTV AGEDPASGHA HTLLVRKSTT
PRTRDGKTFY DVEYFLAHAP TATGVPDLVA AAGTRWTIEE NNGQGKDVLG LDQYQVRKWT
PWHRHVTLSM LAQAFLAATR ANPGKDPRIQ EATS