Gene Francci3_3412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3412 
Symbol 
ID3905652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4053575 
End bp4054945 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content74% 
IMG OID637880735 
Productvon Willebrand factor, type A 
Protein accessionYP_482495 
Protein GI86742095 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0316076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA GGGACCGGCT CGGCGTCGAG GGCCGGATGG ACCGGGGTGT CACGAAGCCG 
GGTGGGGTGC TCCGGCTGAC GATCAGCGTG GCCGATCCGC CGCGGGGCCG GCGGGAGGTG
GCGGTCGCCG TCGCGGTCGA CGCATCCCTG TCGATGCTGC ATCCGGCGGC GCCGGACGCG
GTCAGCAAGT GGGAGCTCGC GCAGCGGGTT CTGGAACGCG TGCACGGCCT GCTGCCCGAC
GGCTGCCCGC TCGTGCTGGT GCGCTTCGCC CGCGAGGCGC GAGTGATCAG CGCCGGACGT
CGGCCCGACC GCCTGCCGGC CGTTGACGGC CCCACGAACA GCGACGCGGA CTCCTACACC
AACATCGGCG ACGCGCTCGC CGTCGCCGGG CGGGAGCTGC TCGCGCAGGC GCCGGACGCG
GACGTCTACC GGGTCCTGCT CATCACCGAC GGTGAGGCCA ACATCGGCCG CTGGCAGCCC
GAGCAGCTCG CCCACATCGT CGCCGGGCTC GGCGACCAGG GCATCGGGAC GGACGCGCTG
GGCATCGGCG TCGACGCCCG GGACGAGGTG CTCGCCGAGA TGGTCGGGGT GTGCGGCCAG
ACGCAGCACG TCTGGGCGGC GGCCGAGGAC GTCGACCGGA TCGACGCGAT CGTCTCGTCG
CTGATCGAAC CGGTGGTCGG CGCGGCCGCG GGGCGCGGTG AGCTGCGGGT CCTGATCCGA
CCCGGCTGGA CGGTGGAGGC GCTGCGGCGG ATCAAACCGC AGCGTCACCT CATGAAGGTG
CCGCCACCGA CCTCGCGCGG CACCGAGCTG CGGCTGCCGC TACCCGCCGT GAGCACCGGC
GAGGACCGTC CGGTCTTCCT GCTCCGGTTG CGCGCTCCGG ACACACCGCT CGGCGCGCAC
ACGATTCTTC AGGCCCGCGG CGGGGTGGTC GCCGGTGGAC GGCGGCTGGC TGTGGACAGC
CGGGCCGACC CGCACGCCGC GCAGCGCTTC CGGGCCACCG TCGAGGCGGA TGTCTTCCCG
GACGCGGAAT CCTCCCTCGA ACACGAGGAG AGCATCGCGG AGTTCGACGA GGAGATTGCG
AAGCGGGCCA GCTTCGCCCA TTCCCGGGAG AGCGTCACCG AGCTGTTCCG GAACGCGGCG
CTGTGGGCGG CGGACCGCGG ATTCGACGAT CTGGCCGACC ACTACAACGC GACGCTACGG
GCCCTCGGCC AGGGTGTCGC GCCGGCCGAC GCGGTGTCGG GAGCCCGGGT CCGGGCGACC
CGCACCAGGA CCCGGACCCG TGATCTCGTC GAGGCGGACC CCGTGTCCGG CCGGCAACAC
GGCGGGGCGG CGTCCCGTCG GCGTCTCGAC GACGTCCTCG GCAACAGCTG A
 
Protein sequence
MNDRDRLGVE GRMDRGVTKP GGVLRLTISV ADPPRGRREV AVAVAVDASL SMLHPAAPDA 
VSKWELAQRV LERVHGLLPD GCPLVLVRFA REARVISAGR RPDRLPAVDG PTNSDADSYT
NIGDALAVAG RELLAQAPDA DVYRVLLITD GEANIGRWQP EQLAHIVAGL GDQGIGTDAL
GIGVDARDEV LAEMVGVCGQ TQHVWAAAED VDRIDAIVSS LIEPVVGAAA GRGELRVLIR
PGWTVEALRR IKPQRHLMKV PPPTSRGTEL RLPLPAVSTG EDRPVFLLRL RAPDTPLGAH
TILQARGGVV AGGRRLAVDS RADPHAAQRF RATVEADVFP DAESSLEHEE SIAEFDEEIA
KRASFAHSRE SVTELFRNAA LWAADRGFDD LADHYNATLR ALGQGVAPAD AVSGARVRAT
RTRTRTRDLV EADPVSGRQH GGAASRRRLD DVLGNS