Gene Franean1_3319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3319 
Symbol 
ID5671691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3930746 
End bp3932089 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content66% 
IMG OID641242208 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001507628 
Protein GI158315120 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.65534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA CCGACGACCG CCGGGCCGAC GCTGCCCGCG ACGTCCCGTT CGCCATGCGG 
AACCCGTTAC ACGTACCACG TGAGCGTTAT TATGACCGAA CCTTCTTCGA GCTGGAGAAG
GAGTATCTGT GGCCGCGCGT CTGGCAGATG GCCTGCCGGC TGGAGGAGAT CCCGCGGCCC
GGCGACTTCG TCGAATACGA GATCTGCGAC CAGTCGATCC TGGTCGTGCG GCAGCCCGAC
CTTTCGGTCA AGGCCTTCCA CAACGCCTGC CGACACCGGG CTACCCAGCT GTGTTCGGGT
TCCGGGCGGC TGCGTGGTGG GCAGATCGTG TGCCCGTTCC ACGGCTGGCG CTGGAACCTC
GACGGCAGCA ACTCGTTCGT GTACGGCGCC GACGGATTCG CGCCGGAGAC CCTGCGTCCG
GACGACATTC GGCTGCGCGA GTGCAGGGTC GACACGTGGG GTGCCTGCGT GTGGATCAAC
ATGGATCCGG ATGCCCGTCC GTTGCGGGAG GCGCTGGCGC CGGTAGCGGG GTTGCTGGAC
GCGGTGTGCG TGGAGAACAT GCGGGTGTGG TGGTGGAAGG AGACGATCCT CAACGCGAAC
TGGAAGGTGG CTCAGGAGGC GTTCCACGAG GGTTACCACG TGATGGGCAC GCATCCGCAG
CTCACGTTCG GTCTGGGTGA CGACTATCCG TTCGGGAATG TCGAGTACAC CGCGTTCGGG
AACGGCCACG GTCGTTTCCA GGGCCGGTTC GACCCGACCG CGGGCGGCGT CTCCCAGGGG
CGTGGTGGGG AGGCGTTCCT GGAGCGGTCG CGGATCCTGT GGGAGGGGCA GGACGCGATG
ACCCTCGAAC GTGACCTGCA CGTCTTCCGG GGGATGCGTA ACCGGGTCGC GCCCGGTGAG
GACTTCCCGA CGGCGGCGAT CAAGGCGTTG TTCGACTACG CCGAGGGTGC CGGTATCCCG
CTGCATCCGA CACCGGAGGG CCTGCGGCTG TGGGGTGGCG GGGTTTTCCT GTTCCCGAAC
TTCCTCATGC TTCCCCAGTT CGGTAACGCG CTGTCGTACC GGGTCCGCCC CTACAACGAC
GACCCCGAGT GGTGCCGTTT CGAGGTGTGG TCGTTGACCA TGTACCCGGA GGGCGAGGAG
CCCGGCCGGG CGAAGCTGAA GGGCCGCTTC GCCTCCGACG ACACCGAGAA CTGGGGTCTG
ATCCCACGCC AGGACTTCAG CAACATCGCA AGCCAGCAAC GCGGCCTGCA TTCGCGCAGC
TACCGCGAGC ACCGTCTCGC CACCGAGATG GAGCAGCTCA TCAGCAACAT GCACTCCGAA
CTCGACCGGT ACCTCGCCGG CTGA
 
Protein sequence
MAVTDDRRAD AARDVPFAMR NPLHVPRERY YDRTFFELEK EYLWPRVWQM ACRLEEIPRP 
GDFVEYEICD QSILVVRQPD LSVKAFHNAC RHRATQLCSG SGRLRGGQIV CPFHGWRWNL
DGSNSFVYGA DGFAPETLRP DDIRLRECRV DTWGACVWIN MDPDARPLRE ALAPVAGLLD
AVCVENMRVW WWKETILNAN WKVAQEAFHE GYHVMGTHPQ LTFGLGDDYP FGNVEYTAFG
NGHGRFQGRF DPTAGGVSQG RGGEAFLERS RILWEGQDAM TLERDLHVFR GMRNRVAPGE
DFPTAAIKAL FDYAEGAGIP LHPTPEGLRL WGGGVFLFPN FLMLPQFGNA LSYRVRPYND
DPEWCRFEVW SLTMYPEGEE PGRAKLKGRF ASDDTENWGL IPRQDFSNIA SQQRGLHSRS
YREHRLATEM EQLISNMHSE LDRYLAG