Gene Franean1_5526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5526 
Symbol 
ID5673856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6693904 
End bp6695292 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content73% 
IMG OID641244382 
Producttransposase IS4 family protein 
Protein accessionYP_001509786 
Protein GI158317278 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCAGA GTACCGGTGT TTACCCCTCT GTCCGCATCG ACGCCGCCGG GCGGGGTGTC 
GTGTCGGCGG CGGGCAGCGT GCTGCTGACC GAGACGGTCC GCGTCAGCGG GCTGGGTCGG
TTGCTGTCCG ACGCGCTCGC GCCGTGGCGG TCACCGCTGG CGACCCATGA CCCGGCGAAG
GTACTGACCG ATCTGGCCGT GGCGCTGGCG TTGGGCGGGG ACTGCCTGGC CGACATCGCC
CAGCTACGCG CGGAGCCGGA GTTGTTCGGG CTGGTGGCCT CGGACCCGAC GGTGTCACGC
ACGATCGACC GGCTCGCCGG CGACGCCGAG AAGGTCCTGA CGGCACTCGA CCGGGCCCGC
GCGGCAGCGC GGGGCCAGGT ATGGGCGGCG GCCGGGGAAC ACGCCCCCGA CCACACGGTG
TCGGTGCACG ACCCGCTGAT CGTGGATCTC GACGCGACAC TGGTCACCGC CCATTCCGAG
AAACAAGACG CGGCGCCGAC ATTCAAACGC GGCTTCGGGT TCCACCCGTT GTGGGCGTTC
ATCGACCACG GCCAGGCCGG CACCGGGGAA CCCGCGACGG TCCTGCTGCG CCCCGGTAAC
GCCGGGTCGA ACACCGCCGC CGACCACATC ACCGTCGCCA CCGGCGCCCT CGCCCAGCTC
CCGGCCCGGC TACGCCGCTC CCGCAAGGTA CTGATCCGCG CCGACTCCGC CGGTGGGACC
CACGCCTTCC TCGCCTGGGC GCACCAGCGC AGGCTGGCCT ACTCGGTCGG GTTCACCCTG
CCCGACAACG CCGCCCAGCT GATCGGCGAG ATCCCCGCGA AGGCGTGGAC ACCGGCCTAC
GACGCCGACC GCCAGCCCCG GCCCGGCGCG TTCGTCGCCG AACTGTCCGG GCTGATGGAC
CTGACCGGCT GGCCACCGGG CATGCGTGTC CTCGTCCGCA AGGAACGCCC GCACCCCGGC
GCGCAGCTGC GTGTCACCGA CGTCGACGGC AACCGGATCA CCGCGTTCGC GACCAACGCC
GCCCGCGGGC AGCTCGCCGA CCTCGAACTA CGCCACCGCC GCCGGGCCCG CTGCGAGGAC
CGTATCCGAA CCGCCAAGGA CACCGGTCTG ACCAACCTGC CGCTGCACGA CTTCACCCAG
AACCAGATCT GGTGCGCCCT TGTCGCACTC GCTCTCGATC TGATCGCCTG GACCCAGATG
CTCGCCCTCG CCGGCCAGCC CGCGCGACGC TGGGAACCGA AACGGCTACG CCACCGCCTG
TTCTGGCTCG CAGGCCGGCT CGCCCACCAC GCCCGCCGCA GCACCCTGCA CCTGCCCCCG
CACCACCCCT GGGCCCCCCT CGCCCTCCAG GCGATCACCA CACTGCGCGC ACTGCCCGCC
CCCGGATAG
 
Protein sequence
MVQSTGVYPS VRIDAAGRGV VSAAGSVLLT ETVRVSGLGR LLSDALAPWR SPLATHDPAK 
VLTDLAVALA LGGDCLADIA QLRAEPELFG LVASDPTVSR TIDRLAGDAE KVLTALDRAR
AAARGQVWAA AGEHAPDHTV SVHDPLIVDL DATLVTAHSE KQDAAPTFKR GFGFHPLWAF
IDHGQAGTGE PATVLLRPGN AGSNTAADHI TVATGALAQL PARLRRSRKV LIRADSAGGT
HAFLAWAHQR RLAYSVGFTL PDNAAQLIGE IPAKAWTPAY DADRQPRPGA FVAELSGLMD
LTGWPPGMRV LVRKERPHPG AQLRVTDVDG NRITAFATNA ARGQLADLEL RHRRRARCED
RIRTAKDTGL TNLPLHDFTQ NQIWCALVAL ALDLIAWTQM LALAGQPARR WEPKRLRHRL
FWLAGRLAHH ARRSTLHLPP HHPWAPLALQ AITTLRALPA PG