Gene Franean1_6197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6197 
Symbol 
ID5674518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7529571 
End bp7530728 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content78% 
IMG OID641245049 
ProductVWA containing CoxE family protein 
Protein accessionYP_001510447 
Protein GI158317939 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.787747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0617522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCGC GGCCGTCCGC CGCGGGTCCG CTCGCCGAAC CGGACCCGCT GGTCGCCCTG 
ACCGGGTTCG CCCGCGCGCT GCGCGGCGCC GGCGCGGCGG CGGACGCGAG CCGGGTCGCG
ACGGCGGTGC AGGCACTCAC CCACCTCGAT CCCACCAGCG CCGCCGACGT GTACTGGGCC
GGTCGCCTGG CGCTGTGCGC CGAACCGGAC GACCTGCCGC GCTACGACGC GCTGTTCGAC
GAGTGGTTCC GCGGGCGCCT CGACGGGCTG CCCGGGCAGG CCGCCGCCGG GGCCGTCGCG
CCGTCGTCGC GGGCGGTGCG GGTCTGGCCG TCCAGCGGCA CCGGCACCGC GCGCGACGAC
GGTGACGACA GCCCGGCCGA CCTGCTCCCC GTCGGCGCGA GCGACGTCGA GCTGCTGCGT
CGCCGCGACG TCGCGGACCT CAGCCCGGCG GAGCGGGCCG AGATCGACCG GCTCGTCGGC
CTGCTCGCCC CCCGGGTCGG GTCCCGTCCC AGCCGCCGGC GCCGGCCCGG CGGGAACCGC
GGGCTCGACC CGCGCCGCAC CGTGCGCGCC ATGCTGCGCG ACGGCGGCGA GCCCGGTGAG
CTCGTCCGCG CGCGCCCGCG GGTACGGCCC CGGCGGCTGG TGTTCCTCGT CGACGTCAGC
GGTTCCATGA GCCCCTACGC CGACGTGATG CTGCGCTTCG CGCACGCCGC CGTCCGGGTC
GCCCCGTTCG CCACCGAGGT GTTCACCTGC GGGACCAGGC TGACGCGACT CACCCGTCCG
CTGCGGCTGC GGGACGCGGG GGAGGCCCTG AGGGCGGCCG GCGAGGCGAT TCCCGACTGG
AGCGGCGGCA CCCGCCTCGG CGAGTCGCTG CGCGCCTTCC TCGACCTGTG GGGGCAGCGG
GGCACCGCCC GCCAGGCGGT CGTGGTGATC GTGTCCGACG GGTGGGAGCG CGGCGACGTC
ACCCTGCTCG CCGAGCAGAT GGCCCGGCTG GCCCGGCTCG CGCACCGGGT CCTCTGGGTG
AACCCGCACA CCGGCCGGGA CGGGTTCACG CCGACCGCCG CCGGCATGTC CGCGGCGCTT
CCCCACGTGG ACGACCTGTT GGCCGGGCAT ACGTTCCAGG CACTGCGAGG ACTTGCCGAG
GTGATCTCCG ATGCGTGA
 
Protein sequence
MNARPSAAGP LAEPDPLVAL TGFARALRGA GAAADASRVA TAVQALTHLD PTSAADVYWA 
GRLALCAEPD DLPRYDALFD EWFRGRLDGL PGQAAAGAVA PSSRAVRVWP SSGTGTARDD
GDDSPADLLP VGASDVELLR RRDVADLSPA ERAEIDRLVG LLAPRVGSRP SRRRRPGGNR
GLDPRRTVRA MLRDGGEPGE LVRARPRVRP RRLVFLVDVS GSMSPYADVM LRFAHAAVRV
APFATEVFTC GTRLTRLTRP LRLRDAGEAL RAAGEAIPDW SGGTRLGESL RAFLDLWGQR
GTARQAVVVI VSDGWERGDV TLLAEQMARL ARLAHRVLWV NPHTGRDGFT PTAAGMSAAL
PHVDDLLAGH TFQALRGLAE VISDA