Gene Francci3_1769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1769 
Symbol 
ID3903999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2104160 
End bp2105347 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content66% 
IMG OID637879107 
Productcytochrome P450 
Protein accessionYP_480874 
Protein GI86740474 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.266129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCC TACCCGACCT CGGTGATGCC GACGCCCTGC GGGACTGGCA CACCTCCTTC 
CGTGCGATCC GCGAACACCA ACGGATGTAC TGGGATGACG CCGTCGGCGC GTGGCTGGTA
ACTCGGCACG CCGACGTCGA GATGGGCCTG TACGACCACC GGCTGTCCTC ACAGGGCCCG
ACGTCATTCA TGGCCCAACT GTCCGCCGAA GACCTGGCGA AATTCGCCGA CCTGCAACGA
TTCTACGAAA GCTGGATGGT CTTCTCCAAC GAGCCTTACC ACACCGTTGT GCGCGGGTCG
GTGCAACGCG TGCTGACCCC GAGAGCGGTG CAGAAACGCC AAGAGGCGGT CCGCGCCGCG
GCGCGGTCAC TGTTGGACCG GGCCAGAGCG GAGGTCGTCG ACGTCAACAG CGACTTCGCC
AGACCGCTGG CCACGGCGGT GATCTCGGAG GTGCTCGGTG TTCCAGAGCA GGAGTGGGAC
AACTGCTCCC GCTGGTCGCA CCACATCATC GACTTCATCA GTGCGCCGCA GCCGGATGCG
TCGCGGGCCA TGGCTGCGGC GGAGTCCTAC GACCAGATGT GCGACTATGT CTACCACCTG
GTGGAGGAAC ATCGCCGCAC CGGCCGCGAC GACTCGCCGA TGCTGGCCGT GGCCGACGTC
GGCGCGCACG CGGTGGTGGG CACGTTCGCG CAGTTCATGA CCGGCGGCTG CGACCCGATC
TCGGCCGCGA TCGCCAACGC GGTGGCCACG TTGCTCGCCC ACCCCGACCA GATGCAGAGA
CTGGAGCGCG ACCGCTCACT GATCCCGACC GCGATAGAGG AGTTCATCCG TTACGAGTCC
CCATTCACCC TCGTGCCCAG AGTGGTGACC GAGCCGATGA CCGTGGCCGG GCAGCACCTC
CACGAAGGCT CTCGGGTACT GTTCATGCTG CTGGCCGCCA ATCGCGACCC TGGTGTGTTT
GAACGTCCGG ACGAGGTGGA CGTCGGCCGT TCACCCAATC CGCACCTGGG CTTCGGCAAA
GGCAGCCATT ACTGCATCGG CGCGGGCTTG GCCCGGCTGG AGATGACCGA GTCGATCGAG
GCGATCATCG ACATGGCGCC GAACCTGGAG TTGGCAGGCC AGGTGGAATG GTCCAGCAGC
TTGGGTCTGC GGTCTGCGGT GAAACTCCCG GTATCGGTGT CCCGATAG
 
Protein sequence
MATLPDLGDA DALRDWHTSF RAIREHQRMY WDDAVGAWLV TRHADVEMGL YDHRLSSQGP 
TSFMAQLSAE DLAKFADLQR FYESWMVFSN EPYHTVVRGS VQRVLTPRAV QKRQEAVRAA
ARSLLDRARA EVVDVNSDFA RPLATAVISE VLGVPEQEWD NCSRWSHHII DFISAPQPDA
SRAMAAAESY DQMCDYVYHL VEEHRRTGRD DSPMLAVADV GAHAVVGTFA QFMTGGCDPI
SAAIANAVAT LLAHPDQMQR LERDRSLIPT AIEEFIRYES PFTLVPRVVT EPMTVAGQHL
HEGSRVLFML LAANRDPGVF ERPDEVDVGR SPNPHLGFGK GSHYCIGAGL ARLEMTESIE
AIIDMAPNLE LAGQVEWSSS LGLRSAVKLP VSVSR