Gene Franean1_0822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0822 
Symbol 
ID5669238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp960226 
End bp962223 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content75% 
IMG OID641239751 
Productvon Willebrand factor type A 
Protein accessionYP_001505186 
Protein GI158312678 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.649548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTCCG GACAGTTCCG CTACGGCCCA TGGGACGGCG GGCCGGACCC GCTGGCCTCC 
CCGTTCTCCG CGACCGACGC CATGGACGAG ATGTCCCGGC AGATCCTCGA GGGGCGCACG
CCCCGGGAGG CCCTGGAAAG CCTGCTGCGC CGGGGCATGC CGAACCGCCG CGGCCTGGAC
GACCTGCGCC GCGCGATCGA GCGCCGGCGC CGCGAGGCCC GCTCGCGCGG CCGGCTGGAC
GGCACCCTCG AGGAGGTCCG CCGGCTGCTC GACACCGCGG TCGGGCAGGA GCGGGCCGCG
CTGTTCCCCG ACCCGGCGGA CGAGGCGCGG ATGCGCGAGG CGGAGCTGGA CGCACTGCCG
TCCGACCCGG CCCGGGCCGT GCGCGAGCTG GCCGACTACG ACTGGCGCTC GCCGCAGGCC
CGCGAGACCT ACGAGCAGAT CTCCGACCTG CTGCGCCGCG AGGTCCTCGA CGCGCAGTTC
CGCGGCATGA AGCAGGCGCT GCAGGGCGCC TCGCCCGAGG ACATGGCCCG CGTGCGCGAG
ATGGTCCGCG AGCTCAACGA GCTGCTGGAG GCGGACGCCC GGGGTGAGGA CACCCAGGGC
CGCTTCGACG AGTTCATGCG TCGCAACGGC GAGTTCTTCC CGGAGAACCC GCGCAACCTC
GACGAGCTCG TCGACGTGCT CGCCCGCCGG GCCGCGGCGG CGGCGCGCCT GCTCGCCGGA
CTGACCCCCC AGCAGCGCCA GGAGCTCGCC GACCTGATGT CCACCGCGAT GGAGGACATG
GGCCTGGCCA CCGAGATGTC CCGGCTGTCG CAGGCGCTGC GGTCGGCTCG GCCGGGCCTG
GACTGGGGCG GACGCTCGCG CGGGCGCGGC CGGGGTCGGT TCTCCGACGC GATGACCGGC
GAGGAGCCGC TCGGCCTCGG TGACGCGACC AGCGCGCTCG AGGAGCTGGC GGAGCTCGAC
GACCTCGCGG CCTCACTCAG CCAGGACTAC CCGGGCGCGT CCCTGGACGA CGTCGACCCG
GAGGCCGTCG AGCGCGCGCT CGGCCGTTCG GCCGTCGACG ACCTGCGTAA CCTGCAGCAG
ATCGAGCGGG AGCTCGAGCG GCAGGGGTTC GTCACCCGCC GCGCCGGAGC GCTGGAGCTC
ACGCCGCGGG CCGTCCGGCG CATCGGGGAG TCGGCGCTCG CGCGGATCTT CCGCGAGGTG
GCGGCGCGCG GGCGCGGCGA CCACAGCCTC ACCGACGCCG GCTCGGCCGG TGACCTGCTC
GGGACGTCCC GGCGCTGGCA GTTCGGTGAC ACCCAGCCCA TCGACGTCGT CCGCACGGTG
CGCAACGCGG TGCTGCGCGG CGGGCCGCCG GGGCGGGGCC AGCGGATCCG GCTGGCGGTC
GACGACTTCG AGGTCGCCGA GACGGAGCGG CGCACCAGCG CGGCCGTCTG CCTGCTCGTG
GACCTCTCCT ACTCGATGGC GCTGCGCGGC ACCTGGGGGG TGGCGAAGTC GACCGCGCTC
GCACTGCACA CCCTGGTGGC CACCCGGTTC CCGCAGGACA AGGTGCACAT CGTCGGCTTC
TCCGACTACG CCCGCGAGCT GCGCCCGGTC GAGCTCGCCG GGCTCGACTC GGAGATGGTC
CAGGGCACGA ACCTGCAGCA CGCGCTGCTC ATCGCCGGAC GGCTGCTGCG CCGCTACCCG
GACTCCGAGC CGGTGATCAT GGTGGTGACG GACGGCGAGC CCACAGCCCA CCTGCAGCGC
AACGGCACGC CGTCCTTCTC CTGGCCGCCG CTGCCCGAGA CACTGGAGCT GACGCTCGCC
GAGGTCGACC GGCTGACCCG CCGCGGCGTC ACGATCAACG TCTTCATGCT GGACGACGAG
CCCCGCCTGG TGCGCTTCGT CGAGGAGATG GCCCGCCGCA ACGGCGGGCG GGTGCTCTCC
CCGGACCCGT CCGCGCTGGG CAGCTACGTC ATCCGGGACT ACCTGCGTTC CCGCAGCTCG
CGGCGCGTCG CCCGCTGA
 
Protein sequence
MSSGQFRYGP WDGGPDPLAS PFSATDAMDE MSRQILEGRT PREALESLLR RGMPNRRGLD 
DLRRAIERRR REARSRGRLD GTLEEVRRLL DTAVGQERAA LFPDPADEAR MREAELDALP
SDPARAVREL ADYDWRSPQA RETYEQISDL LRREVLDAQF RGMKQALQGA SPEDMARVRE
MVRELNELLE ADARGEDTQG RFDEFMRRNG EFFPENPRNL DELVDVLARR AAAAARLLAG
LTPQQRQELA DLMSTAMEDM GLATEMSRLS QALRSARPGL DWGGRSRGRG RGRFSDAMTG
EEPLGLGDAT SALEELAELD DLAASLSQDY PGASLDDVDP EAVERALGRS AVDDLRNLQQ
IERELERQGF VTRRAGALEL TPRAVRRIGE SALARIFREV AARGRGDHSL TDAGSAGDLL
GTSRRWQFGD TQPIDVVRTV RNAVLRGGPP GRGQRIRLAV DDFEVAETER RTSAAVCLLV
DLSYSMALRG TWGVAKSTAL ALHTLVATRF PQDKVHIVGF SDYARELRPV ELAGLDSEMV
QGTNLQHALL IAGRLLRRYP DSEPVIMVVT DGEPTAHLQR NGTPSFSWPP LPETLELTLA
EVDRLTRRGV TINVFMLDDE PRLVRFVEEM ARRNGGRVLS PDPSALGSYV IRDYLRSRSS
RRVAR