Gene Francci3_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3803 
Symbol 
ID3905551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4560608 
End bp4561780 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content68% 
IMG OID637881129 
Productmolybdopterin biosynthesis-like protein MoeZ 
Protein accessionYP_482882 
Protein GI86742482 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.209325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.526883 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCTTC CGCCCCTGGT CGACCCCGCC GAGGGGCTGA CCGTCGACGA GATCCGCCGG 
TACTCTCGGC ATCTGATCAT TCCGGATGTC GCCATGGACG GCCAGAAGCG GCTGAAGAAC
GCCAGGGTGC TGGCGGTCGG TGCCGGTGGC CTCGGCTCGC CGACGCTGAT GTACCTGGCC
GCCGCCGGCG TCGGGACGCT AGGCATCGTC GAGTTCGACA CCGTTGACGA GTCGAACCTG
CAGCGTCAGA TCATCCACGG CCAGTCCGAC GTCGGCCGCT CGAAGGCGGA GTCGGCCCGT
GACTCGGTTC GCAACATCAA CCCGTACGTG AACGTCGTCC TGCACGAGAC CCGGCTGGAC
GCCTCCAACG TCATGGAGAT CTTCAGCGGG TACGACCTCA TCGTCGACGG CACGGACAAC
TTCGCCACCC GTTACCTGGT CAACGACGCC GCGGTGCTGC TCGGCAAGCC CTACGTCTGG
GGTTCGATCT ACCGCTTCGA CGGTCAGGCC AGCGTCTTCT GGGCCGAGCA CGGACCGTGC
TACCGCTGCC TCTACCCGGA GCCGCCTCCT CCCGGCATGG TCCCCTCCTG CGCCGAGGGC
GGGGTGCTGG GTGTGCTGTG CGCCTCCATC GCCTCCATCC AGACCACCGA GGCCATCAAG
GTGCTGACCG GGGTCGGTGA TCCGCTGGTC GGTCGGCTGA TGGTGTATGA CGCCCTGGAG
ATGACCTATC GGTCGATCAA GGTCCGCAAG GACCCGGAGT GCCCGTTGTG CGGGAAGAAC
CCGACGATCA CCGAGCTGAT CGACTACGAG GCGTTCTGCG GGGCGGTCTC GGAGGAGGCG
CAGTTGGCCG CCGCCGGCTC GACGATCACC GCGGGCGAGC TCAAGAGCTG GCTGGATGCC
GGCGAGCCGA TCGAGCTCGT CGACGTCCGT GAGCCGGCCG AGTGGGAGAT CGTCCGGATC
CCCGGCGCGC GCCTGATCCC CAAGGGGGAC CTGCCCGCGC ATCTCTCCGA ACTGCCGCAG
CACCGTCGGG TGGTCGTCTA CTGCAAGTCC GGGGTGCGCT CGGCCGACGC GCTCGCCACG
CTGAAGGGCG CAGGCTTCTC CTCTGCCGTG CACGTCCAGG GTGGCGTGAC CGCGTGGGCG
ATCCAGGTCG ACAAGTCGCT GCCGGTCTAC TGA
 
Protein sequence
MSLPPLVDPA EGLTVDEIRR YSRHLIIPDV AMDGQKRLKN ARVLAVGAGG LGSPTLMYLA 
AAGVGTLGIV EFDTVDESNL QRQIIHGQSD VGRSKAESAR DSVRNINPYV NVVLHETRLD
ASNVMEIFSG YDLIVDGTDN FATRYLVNDA AVLLGKPYVW GSIYRFDGQA SVFWAEHGPC
YRCLYPEPPP PGMVPSCAEG GVLGVLCASI ASIQTTEAIK VLTGVGDPLV GRLMVYDALE
MTYRSIKVRK DPECPLCGKN PTITELIDYE AFCGAVSEEA QLAAAGSTIT AGELKSWLDA
GEPIELVDVR EPAEWEIVRI PGARLIPKGD LPAHLSELPQ HRRVVVYCKS GVRSADALAT
LKGAGFSSAV HVQGGVTAWA IQVDKSLPVY