Gene Franean1_4946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4946 
Symbol 
ID5673285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5938085 
End bp5939380 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content74% 
IMG OID641243800 
Productvon Willebrand factor type A 
Protein accessionYP_001509216 
Protein GI158316708 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0264631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.726246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGACT TCAGTCTCGA GGTCAGCCAG AACAAGTACC TGCCCGAGGG GTCGGGCGAG 
GTGCACGCCG TCATCACCGT CACCGCGCAC GACGTCCGCC CTGGTACTCC CGGCGCCCCC
GGGACGGCCG GGACCGCTAC CACGGCGGGC GCCGCCGAGG TCATCCTGCT GGACTGCTCG
GGGTCGATGG ACTACCCCCA CTCGAAAATC ATCGAAGCCC GCCGGGCGGC CCAGGCCGCC
ATCGACACGC TGCACGACGG TGTCGCGTTC GCCGTGGTGG CAGGCACCGG GCAGGCCGAA
ATGGTGTACC CGACGCGCCA GGAGCTCGTC GAGGCGTCGC CGCGGACCCG CGAGGCCGCG
AAGGCCGCGG TGAAACGGCT GCAGCCCCAC GGCGGCACCG CCATGGGCCG GTGGCTGCTG
CTCGCCCGCG ACCTGATGGC CACCCGCCCG GACGCCATCC ACCACGCGAT CCTGCTGACC
GACGGCCAGA ACGGCGAGAG CGAGGCCGTC TTCGCCGCCG CACTGGCCGC GTGTGAGGGT
CGGTTCCAGT GCGACTGCCG CGGCGTCGGT GCCGACTGGA AGGTGGCGGA GCTGCGCCGC
GTCGCCTCCA CCCTGCTGGG CGGCGTCGCC CTGCTGCGCG AGCCGGCGGA GATGGCGGAG
GACTTCCGCT CGCTGATCGA GCGGGCACAG GCCCGCGGGA TCGACCGGGT CGGCCTGCGG
GTGTGGACGC CCAAGGGGGC GACGATCCGG TTCCTGCGCC AGGTGTCGCC CGAGCTCGAG
GACCTCACCG CGCGGGCCGT CGAGGTCAAT CCGCTCACCC GCGACCATCC GACGGGCGCC
TGGGCCACCG GGACACGGGA GTACCACCTG TGCGTCGACG TCCCCCCGGC GCCGGTGGGG
AACGAACGGC TCGCCGCCCG CGTCAGCGTG ATCGCCGGCG GGGACGAGCT CTCCCGGACG
GCGGTGCTCG CCGCGTGGTC CGAGGATGAC GAGCTGTCGA CCCGCATCGA CGAGGTCGTC
GCGCACTACA CCGGCCAGAC GGAGCTGGCG CGCGCCGTGC AGGACGGGCT CGCGGCACGC
CGCGACGGGG ACGAGGTCAG CGCGGTCACC CTGCTCGGCA GGGCCGCGCG GATCGCGGCC
GCCGCCGACG ACGGCGCGAC CCTCGAACGG CTGGCGAAGG TCGTCGACAT CGACGATGCG
GCCACCGGAG CGGTGCGGCT GCGTCCCCAG GTCGACACCC TCGACGAGAT GGACCTGGAC
GCGGGCTCGA CCGTCACCGT GCCCGCCCGG CGGTGA
 
Protein sequence
MVDFSLEVSQ NKYLPEGSGE VHAVITVTAH DVRPGTPGAP GTAGTATTAG AAEVILLDCS 
GSMDYPHSKI IEARRAAQAA IDTLHDGVAF AVVAGTGQAE MVYPTRQELV EASPRTREAA
KAAVKRLQPH GGTAMGRWLL LARDLMATRP DAIHHAILLT DGQNGESEAV FAAALAACEG
RFQCDCRGVG ADWKVAELRR VASTLLGGVA LLREPAEMAE DFRSLIERAQ ARGIDRVGLR
VWTPKGATIR FLRQVSPELE DLTARAVEVN PLTRDHPTGA WATGTREYHL CVDVPPAPVG
NERLAARVSV IAGGDELSRT AVLAAWSEDD ELSTRIDEVV AHYTGQTELA RAVQDGLAAR
RDGDEVSAVT LLGRAARIAA AADDGATLER LAKVVDIDDA ATGAVRLRPQ VDTLDEMDLD
AGSTVTVPAR R