Gene Franean1_5918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5918 
Symbol 
ID5674239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7189343 
End bp7190542 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID641244766 
ProductIS605 family transposase OrfB 
Protein accessionYP_001510168 
Protein GI158317660 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.320804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACGTT CCTTCAGGTT CCAGCTTCGC CCGACCGCCC GGCAGGCCGC GGCGCTGAGC 
GTGATGCTCG GCGACCACCG GGCGCTGTAC AACGCCGCGT TGCAGGAACG CCGCGACGCC
TGGCGTCACC CGTCGAAGAC CACGATCCGC TACGGCGACC AGTCCGCCCA GTTGAAGGAG
ATCCGCGCCT GTGACCCGGA CCAGGGCCGC TGGTCGTTCT CCTCTCAGCA GGCCACCCTG
CGCCGGCTCG ACAAGGCGAT GGCCGCGTTC TTCCGCCGGG TCAGGGCGGG CGCGACGCCC
GGCTACCCGC GGTTCAAGGG CGCGGGCCGG TTCGACACGG TGGAGTGGCC GAAGGACGGT
GATGGTTGCC GCTGGGACTC CCAGCCCGGG CATCCCGCCC AGACCCGGGT CCGACTGCAG
GGCATCGGAC ATGTCAGGGT CAACCAGCAT CGGCCCGTGG CCGGCACGGT CAAGACGATC
AGCCTGAAGC GGGAAGGCCG CCGCTGGTAT GTGCTGTTGT CCTGCGACGA CGTGCCCGCT
GAGCCCTTCC CGGCCACCGG GGTGGTGGTC GGGGCGGACC TGGGTGTGGC GTCGCTGGTC
ACCCTCTCCG ACGGCCGCCA CGTCGGGAAC CCCCGCTACC TCGCGGCGGC GGCCGGCCGG
CTCGCGCGGG CGCAGCGGGA ACTGGCCCGC AAGAAGCGTG GGTCCACCCG GCGCCGGAAG
GCGGTCGCCA CGGTCGCGGC GCTGCACGGC ACGGTGCGCC GCCAGCGACT CGATCTCGCC
CACAAGGCGG CCCTCAGGCT GGTCCGTGAG CATGATCTGA TCGCCGTCGA GGCGCTGAAG
GTCACCAACA TGACCCGCAG GGCCGAACCG AAGCCCGACC CTGACCAGTC GGGGGCGTTC
CTCCCGAACG GTCAGGCCGC CAAATCCGGG CTGAACAAGT CGATCCTTGA CGCGGGATGG
GGGGTGTTCC TCGCCGTGCT GCGCGCCAAG GCTGAAAGTG CCGGACGGGT GGTCGTCGAG
GTCAACCCCG CCCACACCTC CCGCACCTGC GCGGCGTGCG GGCACTGCCA CGCCGACAAC
CGCAGAACAC AGGCCGCGTT CACCTGTGTC GCCTGCGGAC ACGCCGCGCA CGCCGACGTG
AACGCGGCGG TCAACATTCT TCGGGTCGGG CTGGCCCGTC AGGCCGCGGA AGCGGCCTGA
 
Protein sequence
MRRSFRFQLR PTARQAAALS VMLGDHRALY NAALQERRDA WRHPSKTTIR YGDQSAQLKE 
IRACDPDQGR WSFSSQQATL RRLDKAMAAF FRRVRAGATP GYPRFKGAGR FDTVEWPKDG
DGCRWDSQPG HPAQTRVRLQ GIGHVRVNQH RPVAGTVKTI SLKREGRRWY VLLSCDDVPA
EPFPATGVVV GADLGVASLV TLSDGRHVGN PRYLAAAAGR LARAQRELAR KKRGSTRRRK
AVATVAALHG TVRRQRLDLA HKAALRLVRE HDLIAVEALK VTNMTRRAEP KPDPDQSGAF
LPNGQAAKSG LNKSILDAGW GVFLAVLRAK AESAGRVVVE VNPAHTSRTC AACGHCHADN
RRTQAAFTCV ACGHAAHADV NAAVNILRVG LARQAAEAA