Gene Francci3_3906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3906 
Symbol 
ID3906674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4674267 
End bp4676261 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content73% 
IMG OID637881232 
Productvon Willebrand factor, type A 
Protein accessionYP_482985 
Protein GI86742585 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0145904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATTCCG GAACATTCCG GTATGGCCCG TGGGACGGCG GACCCGACCC ACTCGCCTCT 
CCCTTCACCG CGTCCGACGC CCTGGATGAG ATGTCCCGCC AGATTCTGGA GGGACGCACC
CCGCGGGAGG CTCTGGAGTC GCTGCTGCGC CGGGGCATGC CGGGCCGGCG TGGTCTGGAC
GACATGCGCC GGGCCGTCGA GCGGCACCGG CGGGAGGCCC GATCCCACGG CCGGATGGAC
GGGACACTCC AAGAGGTCCG CCGGCTGCTC GACACCGCCG TGGGCCAGGA ACGGGCGGCG
CTGTTCCCCG ACCCCTCCGA CGACGCCCGG CTGCGCGAAG CCGAGCTGGA CGCGCTGCCC
GCGGACCCCG CCCGTGCGGT GCGCTCGCTC GCCGAGTATG ACTGGCGGTC CCCGCCGGCC
CGCGAGACGT ACGAGAAGAT CTCCGAGCTG CTGCGTCGCG AGGTCCTCGA TTCCCAGTTC
AAGGGGATGA AGCAGGCCCT GGAGGGGGCC ACGCCCGAGG ACTTCGCCCG GATCCGGGAG
ATGGTCGGCG CGCTCAACGC CCTGCTGGAG GCGGACGCCC GCGGTGAGGA CACCGACGAG
GCGTTCGCCC GGTTCATGCG GGAGTTCGGC GACTTCTTTC CGGAGAACCC CGGCTCGCTG
GAGGAACTGG TTGACGCGCT GGCCCGCCGG GCCGCCGCCG CCGCGCGGTT GCTGGCCGGG
CTCACCCCGC AGCAGCGCTC CGAACTCGCC GATCTGATGT CCACGGCGAT GGAGGACCTG
GGGCTCGCCG CGGAGATGTC ACGACTGGCG CAGGCGCTGC GCGCGTCCCG ACCTGACCTC
AACTGGGGTG GACGGCTGCG GGGGCGGGCC GGCCGGTTGG GCGACGGCCT GAGCGGCGAG
GAACCGCTCG GTCTCGGCGA CGCCACCACC GCGCTGGAGG AGCTCGCGGA GCTGGACGAA
CTGGCCGCCG CGCTGGGGCA GGACTACGCC GGGGCCTCCC TGGAAGACGT TGACCCCGAG
GCGGTGGCCC GGGCGCTGGG CCGGTCCGCC GTCGACGATC TGCGAGGGCT GCAGGAGATC
GAACGGGAGC TGGAGCGGCA GGGATTCCTC ACCCGCCGGG CCGGGAGCCT CGAGCTCACC
CCGCGGGCGG TGCGCCGGAT CGGGCAGGCC GCACTGGCCC GGATCTTCCG GCAGGTCTCC
GCCCGGGGCC GCGGGGATCA CAGCGTGACC GACGCGGGTT CCGCCGGGGA TCTGCTGGGC
ACCTCCCGGG CCTGGCAGTT CGGCGACACC CAGCCGATCG ACGTCGTCCG CACGGTCCGC
AACGCGGTGC TGCGCGGCGG TCCGCCCGGC CGGGGCCACC CGATCCGCCT GGCGGTGGCC
GACTTCGAGG TCGCCGAGAC GGAGCGCCGC GCCACGGCGG CGGTGTGCCT GCTCGTCGAC
CTGTCCTACT CGATGGCGCT GCGGGGCACC TGGGGCATCG CGAAGTCGAC GGCACTCGCG
CTGCACACGC TCGTCAGCAC GTCGTTCCCG CAAGACAAGA TCCACATTGT CGGCTTCTCG
GACTACGCCC GGGAGCTGCG GCCGGTCGAG CTCGCCGGGC TCGACTCCGA GATGGTGCAG
GGCACGAACC TGCAGCACGC CCTGCTCATC GCTGGCCGGT TGCTGAGCCG CTACCCGCAG
TCGGAGCCGG TCATCATGGT GGTCACCGAC GGTGAGCCCA CCGCGCACCT GCTGCGCGAC
GGTACTCCCT CGTTCTCCTG GCCGCCGATG CCCGAGACCC TGGAGCTGAC CCTGGCCGAG
GTCGACCGGC TCACCCGGCG CGGCGTGACC GTCAACGTGT TCATGCTCGA CGACGAGCCA
CGGCTGGTGC AGTTCGTCGA GGAGATAGCC AGACGCAACG GCGGGCGGGT TCTCTCACCC
GATCCCGCCG CCCTCGGCAA CTACGTCATC CGGGACTACC TGCGGGCCCG GGGCAGGCAC
CGCACCGCCC GCTGA
 
Protein sequence
MNSGTFRYGP WDGGPDPLAS PFTASDALDE MSRQILEGRT PREALESLLR RGMPGRRGLD 
DMRRAVERHR REARSHGRMD GTLQEVRRLL DTAVGQERAA LFPDPSDDAR LREAELDALP
ADPARAVRSL AEYDWRSPPA RETYEKISEL LRREVLDSQF KGMKQALEGA TPEDFARIRE
MVGALNALLE ADARGEDTDE AFARFMREFG DFFPENPGSL EELVDALARR AAAAARLLAG
LTPQQRSELA DLMSTAMEDL GLAAEMSRLA QALRASRPDL NWGGRLRGRA GRLGDGLSGE
EPLGLGDATT ALEELAELDE LAAALGQDYA GASLEDVDPE AVARALGRSA VDDLRGLQEI
ERELERQGFL TRRAGSLELT PRAVRRIGQA ALARIFRQVS ARGRGDHSVT DAGSAGDLLG
TSRAWQFGDT QPIDVVRTVR NAVLRGGPPG RGHPIRLAVA DFEVAETERR ATAAVCLLVD
LSYSMALRGT WGIAKSTALA LHTLVSTSFP QDKIHIVGFS DYARELRPVE LAGLDSEMVQ
GTNLQHALLI AGRLLSRYPQ SEPVIMVVTD GEPTAHLLRD GTPSFSWPPM PETLELTLAE
VDRLTRRGVT VNVFMLDDEP RLVQFVEEIA RRNGGRVLSP DPAALGNYVI RDYLRARGRH
RTAR