Gene Franean1_3276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3276 
Symbol 
ID5671649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3881847 
End bp3883157 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content66% 
IMG OID641242167 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001507587 
Protein GI158315079 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGGAG TAGTGTCCAG CATCCAGGAA CGTCTTGCGA CCGGTAGAGG CAAGTACACA 
CCGGGGTACC CGAACCTCGA CACCGGGCCG GTCGACTACG AGGACTCGAT CTCCGAGGAG
TTCTTCCAGG CCGAGCGCGA GGCGATCTTC AAGCGGACCT GGCTGAAGGT CGGCCGGATG
GAGCAGCTGC CCCGCAACGG CACGTTCTTC ACCCGCGAGT TCGTGGGCCT GGGGTCGATC
GTGATCACCC GGCACACCGA CGGCGAGGTG TACGCGCTGC ACAACATCTG CGCGCACCGC
GGCAACAAGG TCGTCTGGCA GGAGCACCCG ACCAACGAGA CCCAGGGCAG CGCCCGGCAG
TTCGCCTGCA AGTACCACGG CTGGCGCTAC GGCCTCGACG GCAAGTGCAC CTACGTCACC
AAGCGGAACG AGTTCTTCGA GTCGCTGCCC GACGACGAGC TCGCCATGCC GCAGCTGCGC
TGCGAGGTCT TCGCCGGGTT CATCTTCGTG AACTTCAGCC AGGACGCTCC GCCGCTGCGC
CAGTTCCTCG GTGAGAAGCT GGCCACCGAG CTGGAGAGCT GGCCGTTCGA GAAGTTCACC
AACCACTGGT CCTTCCGGAC GAAGGTCAAG GGCAACTGGA AGATCGGCAT CGACGCGCTG
CTGGAGTGGT ACCACCCGGC GTACGTCCAC GGGCGGTTCC TCAACACCAA CGTGGCCGAG
GCGGAGAAGC TCGTCCCGCC GATGGACTCC TACCATTACG ACCTGTTCAC CCCGCACATG
CTGACCTCGG TGCCCGGCCC GCCGCTGCTG AAGAAGAAGC AGGGCTCGGT CGGCCCGGCC
AAGCGGGACA TGAACTGGGC CTACCGGCTG TTCCGCGCCG GCCTGTTCGG CCCGGACGAC
GTCCGCGAGG ACCTCGGCCA CCTCACCCCG GACCGCAACC CCGGCAACGT CCAGTCCTGG
AGCAACGACC AGTACTGGCT GTTCCCGAAC CTGTCGGTCC AGCTCTGGGG CCGCGGGTAC
TACATCACCT ACCAGTACAT CCCGGAGACG GTGGGCACCC ACGCCTACGA GGTCGACATC
TACTTCCCGG AACCGAAGAC CGCCTCCGAG CGCCTCGCCC AGGAGCTCGT CGTCGACAGC
ACCATCGAGT TCGCGATGCA GGACACGAAC ACGGTGGAGG CGACCTGGTC GCAGCTCAAC
AACCGCGCGC TGCAGACGTT CCACCTGTCC GACATGGAGC TGATGATCCG TCAGTTCCAC
AAGGTCGTCC GGGACGCCGT CGCGGCGCAC CAGGCCGGCA GCGAGAAGTA G
 
Protein sequence
MGGVVSSIQE RLATGRGKYT PGYPNLDTGP VDYEDSISEE FFQAEREAIF KRTWLKVGRM 
EQLPRNGTFF TREFVGLGSI VITRHTDGEV YALHNICAHR GNKVVWQEHP TNETQGSARQ
FACKYHGWRY GLDGKCTYVT KRNEFFESLP DDELAMPQLR CEVFAGFIFV NFSQDAPPLR
QFLGEKLATE LESWPFEKFT NHWSFRTKVK GNWKIGIDAL LEWYHPAYVH GRFLNTNVAE
AEKLVPPMDS YHYDLFTPHM LTSVPGPPLL KKKQGSVGPA KRDMNWAYRL FRAGLFGPDD
VREDLGHLTP DRNPGNVQSW SNDQYWLFPN LSVQLWGRGY YITYQYIPET VGTHAYEVDI
YFPEPKTASE RLAQELVVDS TIEFAMQDTN TVEATWSQLN NRALQTFHLS DMELMIRQFH
KVVRDAVAAH QAGSEK