Gene Franean1_0633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0633 
Symbol 
ID5669050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp733839 
End bp735578 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content79% 
IMG OID641239560 
Producthypothetical protein 
Protein accessionYP_001504998 
Protein GI158312490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00895594 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.470302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTTCTG GTCGTGCCTG GCGACCCGTC GCCGGCCTCC GCCCGGCCAC CCTCCAGCCG 
GGCGGAGGCC TCGGCAGCCC CGATCGCAGC CCCGACGCAC CAACCAGCCA CGCCCCGAGC
CCGAACCGCC CGGCCCGGGC GGTTCGCCGC CCGCGAAGAT CTGCCCGCGG CGAACCGCCC
GCGGAGCCGC GACGCCCCTG GAGGTCCACG ATTCCCGGCA GTTGGCGGCG CCTCCCGTGG
CGGCCCTCTC GGCAGTTCGC GTGGCGGTTC CTCAGGCCGG CACGCGGCGG GCGTGCCCGC
GCCGGGCTGG GCGTGCTGTT TCTCTGCGCC GCGGCGGCCC TCGTCGCGAC CGGTCGCGGG
GCCGCACCCA CCGAGCTCGA GCTCACCGAC GGCGGCGTGT GGCTGGCCAC CACCAGCACA
GGCACCCTCA CCCACCTGAG CGGACCAGCC GGGCGGGCCG ACGCCGCGGT CACCGTGCCC
GGCGCCGTCG GACGTGACCT GACTGTCGCC CGCACCGGCG CGGCGGTCCT CGTGGCCGAC
CCCGGTTCGG GCCAGGTGCA CCTGGTCGAC CCTGCCCGGC TGGCCTCCGT CCGCTCGGCG
GATCTGGGGC CCGGGGTGAC GATCGTCACC TCGGCGACGG CGGCCTACGC GGTCGACCCG
GCGTCCGGGC GGGTCCGGCG GCTCACCCGC GACGACCTCG CCGGCGCCGG CCCCGTCCTG
GAGCTCCCGC CGCCGCTGGG GCGCGCGGCG CTGACCGACG ACGGAACCCT GTGGGTGCCG
GTCCGGTCCG CGGGCACAGT CGTCGCCCTG CGGGACGGCG CCGCCGAGCC GCCGCGCCCG
GTCGCCCCGC CCGGCAACGC CGTCGACGTA GTCCTCGCGG GCGGGCACCC GCTCGCCGTG
GACACCACCG CCGCCACCGT CACCGCCCCC GACACGGGAC GCGTGATCGC CCTGCCGCCC
GCCGGCCCGA CCAGCGGACC GCTCCCCGGC CTGCTGGCGC CGCCGCGCAC GGACGGCGGC
CCCGTGCCCC TCCTCGACCC GGCCACCCGC CGCCTGTTCC TCGTGGACGT CGATCAGGGC
TTGGCGACCA CAGTTACCAC GGTGACCATC CCGGACATCC CCGGATCGGG CCAGCTCGGC
ACACCGGTGG TCCATGCCGG TCACGGGTAC GTGCCGGACT CCGCGCTCGG CGTGGTGCTC
GACTACGACA TCGCCCGCGG CGGCTTCGGC GATCCGGTGC CGGTCGCCGC GCCCGCCGAC
CAGCCGCGGC TCACGGTGGC CGTCGACGGC GACCTGGTGT GGATCAACGA CCTGGCCGGG
CCGAACGCCG TCCTCATCGA CGGCCGGGGG CGGACGGCGA TCGCCAAGCA GCCGCCGGAT
CTCGCCGGTC TGGCCACCGA GGCGGCCCGC CCGCTCCCGC CGGCGCCGCC GCTGCCGACG
GCAGGGCGGC CGTCGGCGCC CGGCCCGGCC CGCGATCCGG GGCCGGCCAC GCCGACCGCA
CCAGCCGCGC CGACGGGGAC GGCCGCGCCG CGGACCACCT CCACCGGGCC GCCCACCCCG
CCCGGACGGA CCACGCCCGA GCCGACCATC GTGCCGCCAC CCACTCCGCC CCCGCCGACC
GGACCGCCAC CGACCGGACC ACCGCGGACG GCACCGCCGG ACGGCACGCC GCCACCGCTC
CCCTCCCCGA CCGTCGCGCC CCGGCCGACC ACCACACCGC CCGGGACCGA CCTCGTGTGA
 
Protein sequence
MSSGRAWRPV AGLRPATLQP GGGLGSPDRS PDAPTSHAPS PNRPARAVRR PRRSARGEPP 
AEPRRPWRST IPGSWRRLPW RPSRQFAWRF LRPARGGRAR AGLGVLFLCA AAALVATGRG
AAPTELELTD GGVWLATTST GTLTHLSGPA GRADAAVTVP GAVGRDLTVA RTGAAVLVAD
PGSGQVHLVD PARLASVRSA DLGPGVTIVT SATAAYAVDP ASGRVRRLTR DDLAGAGPVL
ELPPPLGRAA LTDDGTLWVP VRSAGTVVAL RDGAAEPPRP VAPPGNAVDV VLAGGHPLAV
DTTAATVTAP DTGRVIALPP AGPTSGPLPG LLAPPRTDGG PVPLLDPATR RLFLVDVDQG
LATTVTTVTI PDIPGSGQLG TPVVHAGHGY VPDSALGVVL DYDIARGGFG DPVPVAAPAD
QPRLTVAVDG DLVWINDLAG PNAVLIDGRG RTAIAKQPPD LAGLATEAAR PLPPAPPLPT
AGRPSAPGPA RDPGPATPTA PAAPTGTAAP RTTSTGPPTP PGRTTPEPTI VPPPTPPPPT
GPPPTGPPRT APPDGTPPPL PSPTVAPRPT TTPPGTDLV