Gene Francci3_1965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1965 
Symbol 
ID3903673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2306248 
End bp2307450 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content68% 
IMG OID637879302 
Productcytochrome P450 
Protein accessionYP_481069 
Protein GI86740669 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATC CCGTGGTGAC CGACGTCGGG ACGTCGGCAC GCAAGATCGA CCGCGAAGAC 
CCGAACGGCG GATGCCCGGT CGTACAGGGC GCCGACGGCG TATGGCGGAT CAGCGGGTAC
GCCGCCGGGC AGGCGGTGCT GCGCAGCTTG GAAACCAAGC AGGCCGGCCT CGGTATCGAC
GCATCTAAGG CGATTCCCAA GCGGATCCGC CGACCGGTGC TGCACAGCGA CGGCCCCGAG
CACCGGGAAC GCCGCCGGCT GACCGCACGA TTCTTCACTC TACGCAGGGT CGACGAGCAC
TACCGCGAGC TGATGCACCG CGTCGCCGAC GAGCAGATCG ACCGGTTGCG CCGCGAGCGG
AGCGTCGACC TGTCCGAGCT CAGCTTCGCC CTCGCCGTCG AGGTCGCCGC CGCGGTGATC
GGCCTCACCA ACAGCCGCCC CGGCATGGCC GCCCGCCTGG AACGGTTCGC CCAGGGAGAC
CTCGGACCGC CGAATGTCAC CAGCATCCGC GGCATCAGGC AGTTCATCCG GCAGAACCGC
CACGCCCTCG CCTTCTACCT CGCCGACGTG CGCCCCGCGG TACGCGCCCG CCGTCGGCGA
CGCACGGATG ACCTCATCTC ACACATGATC GACCAAGGCT GCACCAATGC CGAAATCTTC
GCCGAGTGCG TCACCTTCGC ACCCGCAGGA ATGATCACCA CCCGGGAGTT CATCAACGTC
GCCGCCTGGC ACCTGTTCAC CGACGACACA CTGCGCGCCC GCTACCACGA CGCTGACCAG
ACCGAACGGA TCGCGATCCT GCACGAGTTC CTGCGCTTGG AACCGGTCAT CTCCACTCTC
AAACGGCGCA CGACGGCCGA CATCCAGCTA CCGGGCCCGC ACGGCCCGCT AACGATCCCA
GCCGGCGCCC AGATCGACAT CGCGGTGAGC AGCACCAACA TCGACACGCA GGCCATCGGC
GCCGACCCGT ACACAGTCCG CCCCGCCCGC CCGATCGGCG ACGGCGTGAG TCCGGCGGGG
CTGAGCTTCG GCGACGGCCC CCACAAATGC CCCGGCGCAC ACGTCGCCAT CCACGAAACC
GACATCTTCC TGCACAAGCT GTTCATGCTC GACGGCCTGC ACATGGCCAG CCCACCCCAG
GTCACCCTCC GGGACGAGAT CGCGGCCTAC GAGCTGCGCG GCCTCGTCGT CACGCTCGAC
TGA
 
Protein sequence
MADPVVTDVG TSARKIDRED PNGGCPVVQG ADGVWRISGY AAGQAVLRSL ETKQAGLGID 
ASKAIPKRIR RPVLHSDGPE HRERRRLTAR FFTLRRVDEH YRELMHRVAD EQIDRLRRER
SVDLSELSFA LAVEVAAAVI GLTNSRPGMA ARLERFAQGD LGPPNVTSIR GIRQFIRQNR
HALAFYLADV RPAVRARRRR RTDDLISHMI DQGCTNAEIF AECVTFAPAG MITTREFINV
AAWHLFTDDT LRARYHDADQ TERIAILHEF LRLEPVISTL KRRTTADIQL PGPHGPLTIP
AGAQIDIAVS STNIDTQAIG ADPYTVRPAR PIGDGVSPAG LSFGDGPHKC PGAHVAIHET
DIFLHKLFML DGLHMASPPQ VTLRDEIAAY ELRGLVVTLD