Gene Franean1_5109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5109 
Symbol 
ID5673444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6117602 
End bp6119296 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content77% 
IMG OID641243960 
Producthypothetical protein 
Protein accessionYP_001509374 
Protein GI158316866 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.344799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.106863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGACT TCCTCGCCGC GGTACGCGGG CTGACGGTCC GCGGTCGCTC GTTCCTCGCC 
GCGGGCGGGG CGTGCGTGGC CTCGGCAGCG GTCATCGGCG AGCAGGACCT CCTCGCGGTC
GGTGCCCTGC TTGTCGCGCT GCCGCTGTTC GCGGCCGGGT TCGTCGCGCG GACCCGCTAC
CGGCTGGCCT GCACCCGCCG GTTGGAGCCG CCGCGGGTCA CCGCGGGCGA CACCGTCTCG
GTACGGATCC GGCTGGACAA CGTCTCCCGA CTGCCGTCCT CGGTGCTGCT CGTCGAGGAC
GCGACCCCCA ACCTCGGCCA CCGCGCGCGC TTCGTGGTGG ACCAGATCGA ACCGGGCGGT
TCCCGTGACC TCTCCTACCC GCTGGGCGCC GGGGTCCGCG GCCGGTACCA GGTCGGCCCG
CTCACGATCC GGCTGACCGA CCCGTTCGGC CTGTGCGAGC TGGAGCGCAG CTTCCGGGGG
CGGGACGAGC TGATCGTCGC GCCCGCCCTG GAGCGCCTGC CGCTGACGCC ACTGGTCGGT
TCGTCCTCGC TCAACAACGA GGTACGCCGC TCGTCGGCAC GGGCGGGTGA GGACGATTCC
ACCACACGGC CATACCGCTC CGGCGACGAT CTGCGCAAGG TGCACTGGAA GACCACGGCC
CGACTGGGTG AGCTGATGGT GCGCCGCGAC GAGCGGCCGC TGACCGGCGC CGCCGCCGTG
CTGCTCGACA CCCGGCACGC GGCCTGGCCC GAGATGGACC GGGACGCCCC CTTCTCCTGG
GCGGTCGGCG CGGCCGGCTC GATCGCGGTC AACCTCGCCC GCAGCGGCTA CGGCGTCCGG
CTGATCGCCG ACACCGGTGT CGCGGCGACC GGTCCCGGCA ACGCCGTCGG CGCGCTGCTC
GACGAACTGG CGGTCATCGC GCCGACCCCG TCGGCCACCC TCAGTCCGGC GCTGGCCAGT
CTGCGCTCGG CCGAGCACTC CGGCATGGTC GTCGTCGTGC TCGGGCGCAC CGACCAGGCG
ACGGCCTCGA TGATCGCGGG TGCCCGGCCG CGCAACGCCC CGGCCATCGC GGTGCTGGTG
GATCTCGCCG GCTGGGGCAC CTCCCCGGCG GCCGGGGGCG ACCTGGAGGT CACCCGGCAC
ACCCTGACCA GGCACGGCTG GACCGTCCTG GTCGCGGGTG CCGGGGCCCG CCTCGCCGAC
ACCTGGCCGC AGATCTTCCG TCCGGGCGCG TCGGCCGGGC GCCGGTTCGC CGTCGGGAAC
GCCGTCGGCG GCATGGCGCG GGTGTCGTAC GGGTCCGCGT CGGGCCCCGG TCCGGCCGCC
CGCGCGGGGT CCGCCCCCCG CGCGGACCAC GACGCCTCCG CCCGCCGGGC CGGCGCGGCA
CCGAACGGCG GCTCGCTCCA CGGCGACTCC CCCCACGACA GCCCGCCCAA CGGCCGCTCA
TCCAACGGCG GTTCGACCAA CGGCGGTTCG GGCCACGGCG GCTCCCCCAA CGGCGGCTCG
GGCGGCAGCG GCTCGCATGG TCGGGACGCC CGCCGCGGCG CGGCGCCCGC CGGGGGTGGG
GCGGACCAGG CCCCCGCCGA CTCGACGCCA CCCCCGCCCG CGACCGGCCC GCTGTACCGG
CCCGGCTCCC CCCACGAGCC CGCGGCCCCC GCCACCACGG GAGGCGGTCC CCCACCGAAT
GGTCGCGGAT GGTGA
 
Protein sequence
MRDFLAAVRG LTVRGRSFLA AGGACVASAA VIGEQDLLAV GALLVALPLF AAGFVARTRY 
RLACTRRLEP PRVTAGDTVS VRIRLDNVSR LPSSVLLVED ATPNLGHRAR FVVDQIEPGG
SRDLSYPLGA GVRGRYQVGP LTIRLTDPFG LCELERSFRG RDELIVAPAL ERLPLTPLVG
SSSLNNEVRR SSARAGEDDS TTRPYRSGDD LRKVHWKTTA RLGELMVRRD ERPLTGAAAV
LLDTRHAAWP EMDRDAPFSW AVGAAGSIAV NLARSGYGVR LIADTGVAAT GPGNAVGALL
DELAVIAPTP SATLSPALAS LRSAEHSGMV VVVLGRTDQA TASMIAGARP RNAPAIAVLV
DLAGWGTSPA AGGDLEVTRH TLTRHGWTVL VAGAGARLAD TWPQIFRPGA SAGRRFAVGN
AVGGMARVSY GSASGPGPAA RAGSAPRADH DASARRAGAA PNGGSLHGDS PHDSPPNGRS
SNGGSTNGGS GHGGSPNGGS GGSGSHGRDA RRGAAPAGGG ADQAPADSTP PPPATGPLYR
PGSPHEPAAP ATTGGGPPPN GRGW