Gene Francci3_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4039 
Symbol 
ID3907000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4823547 
End bp4825004 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content74% 
IMG OID637881368 
Productmolybdopterin molybdochelatase 
Protein accessionYP_483118 
Protein GI86742718 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACC CGGCGTGCGA CGAGTGGTGC ATCGAAGGGC AGAGAATTGG ACGGGCGCCC 
ATGACTTCGG TGGATGAACA CCTCGAGCGG ATCCTGGCTA CGGTGCGGCC GACCCGCGCT
CGGCAGGCCA GGCTGGACGA GGCGCACGGA TGCGTGCTCG CCGCGGACGT CCGGGCGACG
GTCGCGCTTC CCGGGTTCGA CAACTCGGCC ATGGACGGGT ACGCCGTCCG GGCCGCCGAC
GTGGCCGAGG CGTCGGCGGC CACTCCCGTC TCCCTACCGG TCATCGGCGA CGTCCCCGCG
GGTCCGGGTT CGCCGGCCGC GCTGCTCCCC GGCGCCGCGG TACGCATCAT GACCGGGGCC
CCGCTGCCCC CGGGAGCGGA TACGGTCGTC CAGCTCGAGT GGACCGACGG TGGCCGCCGG
ACCGTTGCGG TCCAGCGGGC GCCGCGCCGT GGCCTGCACA TCCGCCGGAC GGGGGAGGAC
GTCGCCCCGG GCGCCCTGGT GCTCCCCGCC GGGACGATCA TCGGCTCCGC GCAGATCAGC
CTGCTCGCCG CGGTCAACGT GGCCTGCCCG CCGGTGCATC CCCGGCCGCG GGTGGCGGTG
CTGTCCACCG GGACCGAGCT TGTCGAGGTC GGTACACCGC TGGGCGCCGG GCAGATCGTC
GACTCGAACA GCCATGGCAT CGCCGCCGCG GCACGGGAGG CCGGCGCGGT GGTACGCCGC
CTGGCCGGGG TGCCCGACGA GCCGGACGCC TTCGCAGCAG CCCTGCACGG GGTGCTTCCC
GACGTGGATG CTGTGGTGAC GACCGGTGGC GTCAGCGTCG GGGCATATGA CGTCGTCAAG
GAGGTGCTCG CGCGGACCGG GACTGTCCGG TTCGATCGGG TCGCGATGCA GCCGGGCAAG
CCGCAGGGTT TCGGCCTGGT CGACGGTGTG CCCGTCTTCA CGTTGCCGGG GAACCCGGTC
AGCGCCCTCA TCTCGTTCGA GCTGTTCGTC CGCCCGGCCC TGCAACGTAT GCGCGGTCTC
GCCGGCACCG GGCTCCCGCG GATTGTCGTG ACCGTCGGAT CCGACCTGCG TTCCCCCCTC
GGGAAGAGGT CCTTCCCGAG GGTGGGGCTG GAGCGGAGCG GGGACGGTAG GCGCCTGGAA
CCCGCCACCG TCGCCCATCT GGCCGGCGGG CAGGGGTCAC ATCAGCTCAC CTCGCTGGCG
GGAGCGCAGG CGCTGCTGAT CATCCCCGAG GGGGTTACCG AGGTGCCGGC GGGCAGCCGG
CTGCCGGCGT TGTTGCTCCC GGACGCATCG GGGGCCATTC TCTCCGGCAT GGCCTGGGTG
GTTGCGGACA CTGCGGACGT CGATGCCCCG GATCCGGCTT CCCCGGATCC GGCTTCCCCG
GCCGGCTTCG GCCCGCGGGC CGAAGCCGGC CGGGGGAACG CGGCCGTGCC CCGGGCCGCC
GGATCGGCGG CCCGGTGA
 
Protein sequence
MPDPACDEWC IEGQRIGRAP MTSVDEHLER ILATVRPTRA RQARLDEAHG CVLAADVRAT 
VALPGFDNSA MDGYAVRAAD VAEASAATPV SLPVIGDVPA GPGSPAALLP GAAVRIMTGA
PLPPGADTVV QLEWTDGGRR TVAVQRAPRR GLHIRRTGED VAPGALVLPA GTIIGSAQIS
LLAAVNVACP PVHPRPRVAV LSTGTELVEV GTPLGAGQIV DSNSHGIAAA AREAGAVVRR
LAGVPDEPDA FAAALHGVLP DVDAVVTTGG VSVGAYDVVK EVLARTGTVR FDRVAMQPGK
PQGFGLVDGV PVFTLPGNPV SALISFELFV RPALQRMRGL AGTGLPRIVV TVGSDLRSPL
GKRSFPRVGL ERSGDGRRLE PATVAHLAGG QGSHQLTSLA GAQALLIIPE GVTEVPAGSR
LPALLLPDAS GAILSGMAWV VADTADVDAP DPASPDPASP AGFGPRAEAG RGNAAVPRAA
GSAAR