Gene Franean1_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0074 
Symbol 
ID5668499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp91442 
End bp92632 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID641239002 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001504447 
Protein GI158311939 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGCA GCGAGCTCAT GGACCTCACC CGGCGCGCGC TCAAAGTCGC CACGGACCGG 
ACAACCGACA TGGAGCCGGC TGAACGCCGT CAGCCGGCCG ATGCCTACAC CAGCCAGGAA
AGTTTCGAAC GCGAACGCGA ACTCGTCCTC CGCTCGCCAC AGCTGGTCGG CTACCGCTCG
GAGCTGCCGA CCGCCGGAAG TTTCTGCACG AAGACGGTGA TGGACGTTCC CGTACTCCTG
ACCCGGAGCC AGGACGGCAC CGTCAGGGCC TTCCACAACA TCTGCGCTCA TCGCCAGGCA
CCGGTCGCGG TGCGCTGCGG CACAGCCGAA CGGTTCGTGT GCCCGTACCA CGCCTGGACG
TACGACACAC AGGGCCGCTT CGTCGGAGGA CCCGGCCGTG AGGGTTTCCC GTCGCTGACG
GCTGGCGGAA GCGGCCTCAC GGAACTGCCG GCCGCGGAGC ATGCCGGGTT TCTGTGGGTC
GGCCTCCAGC CGGAGAACGG GCCCCTGGAC ATCGAGGCCC ATCTGGGGCC GCTCGGCCCG
GAGCTTGCCT CGTGGGGGAT CGGCGACTGG TCGCTGGTGG GCGAGCGGGT ACTCGACTCT
CCGATCAACT GGAAGCTCGC TCTGGACACC TTCGCCGAGA GCTATCACTT CTCCACCCTG
CACCGCGGCA CGTTCGCCCA GCTCGCCCTG GGAAACTGCG CGCTTTTCGA CTCGTTCGGC
CCGCACCATC GGCTGGTCGT TCCGTTGCGG CACATCACGC GCCTCACGGA CCTTCCCGCC
GAAGAGTGGA AGCCGCTGGA CAACCTGTCG ATCGCTTATG CGCTGTTTCC TAACATCGTC
GTCTCGGTGA GTGCCGCCAA CAGCGAGGTC TTCCGGATCT ACCCCGGCAG CGGGCCCGGG
CACTCGGTGA CCTGCCACCA GAACGCCTCC GGGCTGGACC TCGAAGACGA GACCACCCGG
GCGGGTGCGG AGAGACTGTT CGACTTCGCG CACTCGACAG TCCGCGACGA GGACTATCAG
CTCGCTGTCG AGATCCAGAA GAACCTTTCC TCGGGCGCGC GATCCGAACT GGTCTTCGGG
CGCAACGAGC CGGGTCTTCA GCACCGTCAC TTCGTCCTCG ACGCAAGGAT GGGCCGTCAG
CCGCGGCCAG CCGGGCAGCC AGGGCGAGTG GACATTGTTC ATCGTTCATG A
 
Protein sequence
MDRSELMDLT RRALKVATDR TTDMEPAERR QPADAYTSQE SFERERELVL RSPQLVGYRS 
ELPTAGSFCT KTVMDVPVLL TRSQDGTVRA FHNICAHRQA PVAVRCGTAE RFVCPYHAWT
YDTQGRFVGG PGREGFPSLT AGGSGLTELP AAEHAGFLWV GLQPENGPLD IEAHLGPLGP
ELASWGIGDW SLVGERVLDS PINWKLALDT FAESYHFSTL HRGTFAQLAL GNCALFDSFG
PHHRLVVPLR HITRLTDLPA EEWKPLDNLS IAYALFPNIV VSVSAANSEV FRIYPGSGPG
HSVTCHQNAS GLDLEDETTR AGAERLFDFA HSTVRDEDYQ LAVEIQKNLS SGARSELVFG
RNEPGLQHRH FVLDARMGRQ PRPAGQPGRV DIVHRS