Gene Franean1_5847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5847 
Symbol 
ID5674170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7093113 
End bp7094411 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content76% 
IMG OID641244697 
Producthypothetical protein 
Protein accessionYP_001510099 
Protein GI158317591 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.426804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGATCA CCGGCCGCGC CGTGGCGCTG GTCGCGCTCG GCATCGTCGT GGTGGCTCTC 
TCCCCGGTGC CAGGCGCGAC CTTCGTGCTG GTGAACCTGC TGGTCGTCGC GCTGATCTGC
GTCGACGTGT GGCTGGCGGG GCAGGTGCGC GCGTGCACGT TCGCCCGCTC CGGGCCGACC
GCGGCCCGGC TGGGTCAGCC GGTTCCGCTC GTGCTGACCG TCACCAACAA CGGAACCCGC
ACCGTGCGGG CCCGCATCCG CGACGGCTGG CCGCCGTCCG CCGGCGCCCG ACCGCGGGTG
CACCGGGTGA CGGTGCCGGC GGGCGAGCGC CGGTTCCTGG AGACCACGCT GAACCCGACC
CGGCGCGGTG ACCGGATGCC CGTCCGCATC ACCGTGCGCT CGCTGGGCCC GCTGGGCCTG
GCCGGGCGGC AGGGCCGGCA CCACTGCCCG TGGCGGGTGC GGGTGCTGCC GCCGTTCTCG
TCCCGCCAGC ACCTGCCGGC CGCGCTGGCC CGGTTGCGCG AGGTGGAGGG CGAGGTCGCG
ATCCGCGGCG GCGGCGCCGG CTCGGAGTTC GACAGCCTGC GGGAGTACGT CGTCGGCGAC
GACGTGCGGA CGATCGACTG GCGGGCCACC GCCCGCCACC ACGGCTCGGT GGTCGTCCGC
ACCTTCCGGC CCGAACGGGA CCGCCGGGTC ATCTGCGTCC TCGACACCGG CCGGACGTCA
GCCGGCCGGG TCGGCGACGT CCCCCGGCTG GACCACGCGT TGGACGCCGC ACTGCTGCTC
ACCGCCGTCG CCCTGCGGGC CGGGGACCGG GTCGGGCTGG TCGCCCACGA CAGCACCAGC
CGGATCTCCC TGCCGACCTC ACGGGACAAC GGCCTGCTGG CCCGGATGAG CGATGCGATG
GCCACCCTGG AACCGGCCCT CGTCGAGGCC GACCACGCCG GGATGGCCTC GGCGGTGCTG
CGCAACGCCT CCCGGCGTGC CCTCGTTGTC ATCTTCACCG AGCTTGTTCC GGCGGTGATC
GAGGACGGCC TGCTGCCGGC GCTGCCCACG CTGACCTCCC GGCACACGGT TCTCGTGGCC
GCGCTGCGCG ACCCCCGGCT TGACGAGCTC ACAGCCGGGC ACGGCGACGT CCACCAGGTG
TACGCGGCGG CGGCAGCCGA GCAGACGCTG CTGCGGCGGC GCGAGCTGAC CGAGGCGCTG
CGCCGCCGCG GCGTCGAGGT GGTGGACGTG GCACCGGCGC ACTACGCGGC GCTGGTCACC
GACACCTACC TCACGCTGAA GGCCCGCGGA CGCCTGTGA
 
Protein sequence
MAITGRAVAL VALGIVVVAL SPVPGATFVL VNLLVVALIC VDVWLAGQVR ACTFARSGPT 
AARLGQPVPL VLTVTNNGTR TVRARIRDGW PPSAGARPRV HRVTVPAGER RFLETTLNPT
RRGDRMPVRI TVRSLGPLGL AGRQGRHHCP WRVRVLPPFS SRQHLPAALA RLREVEGEVA
IRGGGAGSEF DSLREYVVGD DVRTIDWRAT ARHHGSVVVR TFRPERDRRV ICVLDTGRTS
AGRVGDVPRL DHALDAALLL TAVALRAGDR VGLVAHDSTS RISLPTSRDN GLLARMSDAM
ATLEPALVEA DHAGMASAVL RNASRRALVV IFTELVPAVI EDGLLPALPT LTSRHTVLVA
ALRDPRLDEL TAGHGDVHQV YAAAAAEQTL LRRRELTEAL RRRGVEVVDV APAHYAALVT
DTYLTLKARG RL