Gene Francci3_4334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4334 
Symbol 
ID3907304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5175226 
End bp5176680 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content71% 
IMG OID637881663 
Productputative cytochrome P450 
Protein accessionYP_483409 
Protein GI86743009 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.686964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAACAC CCATTCCCGA AGCAGGCGCG GAATCAGGCG CTGCGATCCC GCCTCCGGGG 
TGTCCGGCGC ACGCGGCCTT CGCGCAGGAT GCCGTACGGA CCGCGCTCTA CGGCCCGGAA
GCGCAGCACG ATCCCGGCGC GGTCTACGAG AAGCTCCGCG CGGAACACGG CGGGATCGCC
CCGGTGGAAC TCGAGGGTGG CGTCCCGGCC TGGCTGGTGC TCGGCTACCG GGAGAACATG
GAGGTCGCTC GCACCCCGAG CCGGTTCACC CGGGACGCCC GGCTCTGGCG GGACTGGAAC
GAGGGCAGGA TCGCCGCGGA CGACCCGCTG CTCCCGCTCA TCGGATGGCG TCCCGACGTC
GTGTCCTACG ATGGCGAGGA ACATGCGCGG CTGCGCGCCG CGGTCAACGA GTGCCTGACC
AGGTTCGACC GGCACGGCAC CCGGCGGCAC GTCCAGCGCT ACGCGAACCA GCTGATCGAC
GGCTTTGTCG AGGACGGCCG CGCGGACCTG GTGACCCAGT TCGCCGTTTA CCTTCCGATG
CTCGTGCTGA GCCGGCTGGT CGGCCTGAGC GAGGGGTACG GCCGGAAACT GGTCGAGGCC
ATCGTCGGCA TGGTCAGCGG CGGCGAGGAT GCATACGCGC ACAACCAGTA CATCATCGGC
ACCCTGCGGT CGCTGACCGA GGAGCGGCGC AGGGCCCCTG CGCACGACCT GGCGTCCTGG
TTCGTTCAGC ACCCCTCGGG GCTGAACGAC GAGGAGGTAC TCAACCACCT GCGCCTCGTG
ATCGTGCTGG GCTACGAGGC CACGGCGAAC CTGGTCTCGA ACACGCTGCG GATGGTGCTG
ACCGACCCCC GCTTCCGCGC GTCGCTCGCT GGCGGGCTCA TGACGCTGCC GGACGCGGTC
GAGCAGATGC TGTGGGACGA TCCGCCGCTA CTGGTCTGCC CGGCGCGGTT CGCCACGCAC
GACATGCATT TCGCCGACAA GGAGATCCGC GAGGGCGACA TGCTGCTGCT CGGCATCGCG
GCAGGCAACG CGGATCCGGA GATCCGTCCG GATCTCGGCG CGCCGATGCA CGGCAACCGC
TCGCACCTGG CGTTCAGCCG GGGCCCGCAC GAGTGCTCGG GACAGGAGAT CGCCCGCGCC
ATCACCGACA CCGGCGTCGA CGTGCTGCTC AACCGGCTAC CCGACCTGCA CCTCACGGTG
CCGGAGGAGC AGCTGACCTG GACCGCCTCG ACCTGGTCCC GGCACCTTGA CGCGCTCCCG
GTCAAGTTCG CCCCGCAGTG CCGCCCCGTC CCGCAGTCTG CGCCAGCCGC CCCGGCGTTC
TCCGCCCCGG TGGTCGAGCC GAGCGCCGCG AAGGACGCGC TGGCGGCGGT GGCCGAGGCC
GACGCACAGG CCAGAGCGGA CACCCGCCGC GGCTTCTGGG CTTCGCTGGG CGGCATGTTC
CGCGGTCGAC GCTAA
 
Protein sequence
MSTPIPEAGA ESGAAIPPPG CPAHAAFAQD AVRTALYGPE AQHDPGAVYE KLRAEHGGIA 
PVELEGGVPA WLVLGYRENM EVARTPSRFT RDARLWRDWN EGRIAADDPL LPLIGWRPDV
VSYDGEEHAR LRAAVNECLT RFDRHGTRRH VQRYANQLID GFVEDGRADL VTQFAVYLPM
LVLSRLVGLS EGYGRKLVEA IVGMVSGGED AYAHNQYIIG TLRSLTEERR RAPAHDLASW
FVQHPSGLND EEVLNHLRLV IVLGYEATAN LVSNTLRMVL TDPRFRASLA GGLMTLPDAV
EQMLWDDPPL LVCPARFATH DMHFADKEIR EGDMLLLGIA AGNADPEIRP DLGAPMHGNR
SHLAFSRGPH ECSGQEIARA ITDTGVDVLL NRLPDLHLTV PEEQLTWTAS TWSRHLDALP
VKFAPQCRPV PQSAPAAPAF SAPVVEPSAA KDALAAVAEA DAQARADTRR GFWASLGGMF
RGRR