Gene Franean1_1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1428 
Symbol 
ID5669833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1723883 
End bp1725100 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content69% 
IMG OID641240349 
Productcytochrome P450 
Protein accessionYP_001505776 
Protein GI158313268 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCTG CGCCGCAGGT TGTATTCGAT GCCTTTGCTC CTGTGTACCG ATCCAACCCC 
TACCCGCGAT ACGCGTTGGT TCGCGAATCC ACCGCCCTAT ACCCGATCAA TCCGCAGATC
GCCATGGCAA CGCGCTACGA GGAGTGCTCG GCGGTTCTCA CGGACGCGCT CTGGGGCCAT
GGCTACGAGG ACGGCATCAA CCCCTTCCGC CCCGGGGTGG ATCCCGACGA CGTCCCGGGC
TCGATGCTGC GGATGGACCC GCCGGACCAC ACCCGGATGC GGGGGCTGGT CAAGCGGGCG
TTCGTCCCGC GCCACACCGA AGGGCTCCGA CCACGGGTCG AAGGTCTCGT CAACGAACTG
ATCGACACCG CGATCGAGGC CGGCGAGGTT GACCTGATGG AGGCCCTGGC CCGGCCGTTG
CCACTCACCG TCATCGGGGA CATGCTCGGC ATCCCGCCGG AGGACTACAC CGCGGTCAAG
AAGTGGTCGC TGGAGATCGT CCGTGGCACG GACCCGGACA TCCTGCAGTC ACCCGAGAGC
CTGGCGCGTC GGCCTGAGGC GATGCGGGAG TTCGAGGCGT ACTTCGCCGG GCTGATCGCG
CAGCGGCGCA AGGACCCTCG CGACGACCTG CTGAGCGATC TCTGCGCGGC GCAGGAACGC
GACTCCGTGC TGAGCGACCG CGAGATGCTC GGGCTCTCCG TAGGGCTGCT GATCGGAGGC
TACGAGACCG TCTCCGACCT GATCGGCAAG GGCCTGGTGG CCTTGTTGCG CAACCCCGAC
CAGGTCGCCC TGTGGCGGTC CAACCCGGAA CTCGCCCCGT ACGCGGTCGA CGAGCTTCTC
CGCTACGAGC CGCCGGTGCA GTTCACCCAT CGGGTCGCGC TGGAGGAGCG GGAGCTCGCC
GGGCGCGCTT TCGCCCGGGG CGAAGGTGTC GTCGTCCTGA TCGCCGCCGC CAACCGCGAC
CCGGCCGTGT ACAGCGATCC TGAGCGCCTG GACATCACCC GGTTCGCCGG GCGTTCCCCC
GCGCCCCGCC ACCTCTCGCT CAGCGAGGGC ATCCACTACT GCCTCGGCGC TCATCTCGGG
CGGCTGCAGA CACAGATCGC GGTGGACACT CTCCTGCTCC GTGCGCCGGG GCTGTCGCTG
ACCGACGACG AACCCGTGTG GCGCGACACA GTCGCCATCC ACGGGCTGGA CACCCTCCCA
ATCCGCCTGC GGGACTGA
 
Protein sequence
MSAAPQVVFD AFAPVYRSNP YPRYALVRES TALYPINPQI AMATRYEECS AVLTDALWGH 
GYEDGINPFR PGVDPDDVPG SMLRMDPPDH TRMRGLVKRA FVPRHTEGLR PRVEGLVNEL
IDTAIEAGEV DLMEALARPL PLTVIGDMLG IPPEDYTAVK KWSLEIVRGT DPDILQSPES
LARRPEAMRE FEAYFAGLIA QRRKDPRDDL LSDLCAAQER DSVLSDREML GLSVGLLIGG
YETVSDLIGK GLVALLRNPD QVALWRSNPE LAPYAVDELL RYEPPVQFTH RVALEERELA
GRAFARGEGV VVLIAAANRD PAVYSDPERL DITRFAGRSP APRHLSLSEG IHYCLGAHLG
RLQTQIAVDT LLLRAPGLSL TDDEPVWRDT VAIHGLDTLP IRLRD