Gene Franean1_0852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0852 
Symbol 
ID5669268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp998673 
End bp1000499 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content79% 
IMG OID641239781 
Producthypothetical protein 
Protein accessionYP_001505216 
Protein GI158312708 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00483421 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.421013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACGG CCTCGTCGGC GGGTGCGCGC GGCGGCGCGG GACGGCGGAG CGGGCGCGAC 
CGGCCCGCGA GCGCCGGCGC GGCCACCACC GGCACGACCA CCGCCGCAGC CGCGGCCGTC
ACCACGGTGA CGGCCGCGCT GCTCGCGGCC GCCTTCTGGG CCTGGGCGGC ACACCGGGTC
CCGGCGGTAG ACGTCGCCGC GGACGCCACC GTCGTGCTGG GTCTGCGGGT CATCGTCTCG
CTCGCGCGCG CCGCCGCGCC CGGACCCGGC GCGCGGCACC GGCTGCGCGC CGGAGCCCTG
GCCGCGCTCG GAGCGGTCAC CCTGGCCTGG GCCGGTGGAT CGCTGATCCC AAGCCTGTCC
TTCCTCGCCG CCACCCCCGG TTTCCTCATC CTCCTGCCGC TGGCGGCCGC CAGCGTCGCC
CTCTGGCCGG CGCGCCCCGA CGTGTGGCGG TTCGACGCGT CCCAGGACTC ACCGAACCGC
CGCACATCGG CGGCCGGGAT GTTGGTCGCC GCGGGGGCGT GGGTGGCGCT GGGCGCCGTT
CGGGTGGGCG CGCTGGTGTT CGCCGCGGAC GTCGCGGCGC GGGGCGGGGC GGGGCATGCG
CACGTCGCCC TGCTCGCATG GCTGGCGCCG GTCGCCCTGG CGGCCGGTCC ACGCTGCGGA
TGGCTGCTCT GGCGGACGAA CGCGGACCCC GGCCTCGCCT CATCCCGTGT GGCCACCCCG
GCGGGGCGGC AGCGCCCCGT CCGGCGGCTC GGCGCCCCCC GGCGGACGAC GACGACGGCG
CGAGCCGTCC CCGCGCCGGA GTGGGAGCCG CCGGCTCCCG GGTCACGCCC GGGCCCCGGA
TCGCTCCCAG CCCGCGGATC GCAACCCGCT CCGGGATCGC TCCCGACCCG CGGCTCGCCA
CCAGCTCCCG GCTCGCTCCC CGCTGCCGGG TCACCGTCGG AGCGAGGATC AGTGGTCCGG
CCCAGCGCGC AGCGGTGGGC GCAGGGCCCC CAGCGACCGC CCAGGTCACG GCGCCCCTCG
CTACCACCGA CGAGGTCGCC GCAGGCGCCG ACCCCGCGGT CACCGTTCTC GGCACTGCCC
ACAGCTCCGC CCAGCAGGAC GCCGTTCGGC AGGACGCAGG TGGATGAGGC GGCGCGGCGA
TCGCTCACGT CACTCGCGGC CGAGCCGGCC GCGCCGGAGC CTGCGACGCG GGTGCTCTGG
TCGGCGGCGG CGGCCGGACT GGCCCTGGTC GTCCTCGATC ACCAGGTCGG CCCGGCGAGC
GCCGCCTTCG CCCTGGCCGT GCTCGCCATC CTGGCCGTCG GCGGGTTCGT CGGCGGCCGT
GGGGCCGCGA CGCTGCCGTG CGTGGCCGTC GCGCTGGTCG CCGCGCCGCT GCCGCTGGCC
GTGCTCGGCG ACGAGTACGC CGATCAGGGA ACAGCCGGGC TCCGCCTGCT CCTGGCCGCC
CTCGCGATCG ACGCCCTGCC AACCGGCGCC CAGCCGTCCA GCGGGGTGCG GGCGGCCGCG
CGGCCGGTGA TCCGGCGCGG GGCCACGGTG GCTGGGCTGA CCGTTCTGGT GGCCATACTC
CCGATCATGC TGGCGGCCTG GGGTGCCGAG GGGGCCGCGC TCGCGCTGCT GCTCGGCCGG
GTGGTCGCCG TGGCGCCTGT GGTCCGCCTG CCGGCGCGCC GCGAGCGCGC CGCCGAGCCG
GCCGCCGAGG CCCCCGGGCA ATCCCGGCCC GCCGGAGCCG CGCCGGCCGC CGTCGACCGC
CGCCCAGAGG CGGCCGGCCG CGGCGGGAGA CGCGCCCGGA TGGCCAGGTT CGCGCCGGGA
GTCAGTCACC CATCTCGGTC ACTGTAG
 
Protein sequence
MTTASSAGAR GGAGRRSGRD RPASAGAATT GTTTAAAAAV TTVTAALLAA AFWAWAAHRV 
PAVDVAADAT VVLGLRVIVS LARAAAPGPG ARHRLRAGAL AALGAVTLAW AGGSLIPSLS
FLAATPGFLI LLPLAAASVA LWPARPDVWR FDASQDSPNR RTSAAGMLVA AGAWVALGAV
RVGALVFAAD VAARGGAGHA HVALLAWLAP VALAAGPRCG WLLWRTNADP GLASSRVATP
AGRQRPVRRL GAPRRTTTTA RAVPAPEWEP PAPGSRPGPG SLPARGSQPA PGSLPTRGSP
PAPGSLPAAG SPSERGSVVR PSAQRWAQGP QRPPRSRRPS LPPTRSPQAP TPRSPFSALP
TAPPSRTPFG RTQVDEAARR SLTSLAAEPA APEPATRVLW SAAAAGLALV VLDHQVGPAS
AAFALAVLAI LAVGGFVGGR GAATLPCVAV ALVAAPLPLA VLGDEYADQG TAGLRLLLAA
LAIDALPTGA QPSSGVRAAA RPVIRRGATV AGLTVLVAIL PIMLAAWGAE GAALALLLGR
VVAVAPVVRL PARRERAAEP AAEAPGQSRP AGAAPAAVDR RPEAAGRGGR RARMARFAPG
VSHPSRSL