Gene Francci3_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1111 
Symbol 
ID3905453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1325632 
End bp1326669 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content68% 
IMG OID637878443 
Productcytochrome P450 
Protein accessionYP_480220 
Protein GI86739820 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.20408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.31292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGAACG CCCACTCAGA CCTCCGCATG AGCTCGCGTG GCGACCTTCT GCGATCGCCC 
ATCCCGCTGC CGATGGCAGG CGAACGCACC GAGCCGGCGC CGGGCATGTT CACCGCGATG
GATCCTCCGG AGCACACCCG CTACCGGCGG CATATCGTCG AGTGGTTCTC GACGCGTCGG
ACCAGGTCCC TGGAGCCGCG GGTGATCGAG ATCGTCGACC AGCACCTGGA CGCGATGATC
GCCGGTGGCG GTCCGGCCGA CCTGGTGTCG GCGTTCGCCG AACCGGTGTC CGCGCTGGTG
ATTTGCGAAC TGCTGGGAGT GCCCGTCGAG CAGCGTGAGG TGTTCGGCGC GGCTATCGCA
GCGTTGTTCA CCGTTCACTC CAGCGCGGAG GAGGCCATCA GCGGCTGGCA GAACATCGGT
GGACTGCTGA TGGGTCTCAT CCAGGCCAAG CGGGTCGCAT CGGCGGACGA CCTGCTGGGC
ACGCTGGTGG CCCGGGGTGA GCTCAGCGAC GAGGAGCTGA TGACGATCGG CAGTGTGCTG
CTGGTCGCCG GGCACGACAC CAGCACCAAC ATGATCGCGA TGGGAACGTT CGCGCTGCTG
GAACATCCCG AGCAGTATGC GGCGCTGGCC GCGGACCCGG GCCTGGCGCC CGGTGCTGTC
GAGGAACTGC TTCGCTACCT GACGATCGTG CACGCCGGCT CGATCCGTGC GGTCTCCGCC
GATCTGGAGT TCGACGGCCA CCAGCTGACC GCGGGTGACG CGGTGTCGCT CTCCCTCGCC
GCGGCCAATC GGGACCCCGC ACTGTGCGAC GCTCCCGACC GCCTCGACAT CACGCGCGAG
CCCGTGCCGC ACCTGGCCTT CGGGCACGGC ATCCACCAGT GCGTCGGACA GCAGCTATCC
CGATTGGAGC TGCGCATCGC GTTCGAATCG TTGGCGCGCC GCCTGCCGAA CCTTCACGTG
GCGGTGCCGA CCAGCGAGAT CCGTACCCGC TCCGAAATGA TCATCTACGG TGTGCGAGAG
CTGCCGGTGA CCTGGTGA
 
Protein sequence
MQNAHSDLRM SSRGDLLRSP IPLPMAGERT EPAPGMFTAM DPPEHTRYRR HIVEWFSTRR 
TRSLEPRVIE IVDQHLDAMI AGGGPADLVS AFAEPVSALV ICELLGVPVE QREVFGAAIA
ALFTVHSSAE EAISGWQNIG GLLMGLIQAK RVASADDLLG TLVARGELSD EELMTIGSVL
LVAGHDTSTN MIAMGTFALL EHPEQYAALA ADPGLAPGAV EELLRYLTIV HAGSIRAVSA
DLEFDGHQLT AGDAVSLSLA AANRDPALCD APDRLDITRE PVPHLAFGHG IHQCVGQQLS
RLELRIAFES LARRLPNLHV AVPTSEIRTR SEMIIYGVRE LPVTW