Gene Franean1_3952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3952 
Symbol 
ID5672313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4725004 
End bp4726656 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content77% 
IMG OID641242831 
ProductPfaD family protein 
Protein accessionYP_001508248 
Protein GI158315740 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID[TIGR02814] PfaD family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0190583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0375057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGACCC CGGTCGCGCC CGCCCCCCGC CCCGCGGCGG CTCAGCCGGC CACGGCCCAC 
CCCGGCGCGG CCGGGGCACT CGCCCGGCGG CCGGCCGACA TCCACCGGGT GCTCGCCGAT
CTGGAGCGGC CCTGCTACGT CGTGCAGGAC GCGGCGGGGC TCGGCGTCAC CGGCGACGAG
CGGGTCGCCG CGGCGGCCGG GCGGGTCCTG GCCGCGGTCG GCCCGCTGCC GCCGGAGCGC
CTCGGCGCCC CGGCGTTCCG CCGCGACCAC CAGGTCGCCC AGGCGTACAT GGCCGGGTCG
ATGGCTAACG GCATCGCGTC GGCCGACCTG GTCGTCGCGC TGGCCCGGGC GGGCTTCCTG
GCCTCGTTCG GCGCGGCCGG CGTCGTCGCC GCCCGGGTCG ACGACGCGCT GGCCGACATC
CGCCGGCGGG CACCGGGCCT GGCGTTCGGC TGCAACCTGA TCCACAGCCC GACCGAGGCG
GCGATGGAGC GCGACGTCGT CGACGCCTGC CTGCGCCACC AGGTGCGCTG CGTCGAGGCG
TCGGCGTTCC TGGACCTCAC CCCGCAGGTG GTGCGCTACC GGGTCGCCGG GCTGCGCCGC
GGCCCCGACG GCCGGGCCGT CGCCGACAAC CGCGTCGTCG CCAAGGTGTC GCGCACCGAG
GTCGCCGAGC TGTTCCTGCG GCCCGCGCCG GCCGCGCTGG TCCGTCCGCT GGTGGAGCAG
GGGCTGGTCA GCGCCGAGCA GGCGGAGCTG GCGGCGACGG TGGCGATGGC CGACGACGTC
ACCGCCGAGG CCGACTCCGG CGGGCACACC GACCGCCGGC CGCTGCCCGT CCTGCTGCCC
GAGCTGCTCG CGCTGCGCGA CAGCCTGCGC GGCGAGTCCG GAGGGCGCAC CGTGCGGATC
GGGGCGGCCG GCGGGCTCGG CACCCCGCGC GCCGTCGCCG CCGCGTTCAC GCTCGGCGCG
GACTACGTGG TGACCGGCTC GGTCAACCAG GCCTCCGTCG AGGCGGCGCA GTCGCCGGCC
ACCAAGACGC TGCTGGCCCA GGCGGGCGTC ACCGACTGCG TCCAGGCGCC GTCGGCGGAC
ATGTTCGAGA TCGGCGCGGA CGTCCAGGTG CTCGGCCGGG GCACCATGTT CGCGTCGAAG
GCGCGCCGGC TCTACGAGCT GTACCGCCGC TTCGACGGGC TCGACGAGAT CCCGGCCGAC
GAACGCGCGG GCCTGGAGCA GCGGATCTTC CGCCGCTCGC TGGACGAGGT CTGGGCCGAC
ACCGTCTCCT ACTTCAGCAC CCGCGACCCC GAGCAGATCG AACGCGCCCA GGAGAACCCG
AAGCGGCGGA TGGCGCTGGT CTTCCGCTGG TATCTCGGGC TGTCGTCGGG CTGGAGCATC
GCCGGTGCCC CGGACCGGGT CGCCGACTAC CAGGTCTGGT GCGGCCCGGC GATGGGCGCC
TTCAACACCT GGGTGCGCGC CAGCGCGCTC GAGCCGCTGG CGAACCGGCA CGCCGCCGTG
ATCGCCGCCG AGCTGATGCG CGGCGCCGCG TTCACCAGCC GGGCCGCCGC GCTGGCCCAG
GCCGGCGTCC GGCTGCCGGC GCTCGCCACG ACCTACGTGC CGCGGCCGCA CCTGCCGCGG
CCAGATTCCG ATCCGGGAGA GCAGGCACCA TGA
 
Protein sequence
MVTPVAPAPR PAAAQPATAH PGAAGALARR PADIHRVLAD LERPCYVVQD AAGLGVTGDE 
RVAAAAGRVL AAVGPLPPER LGAPAFRRDH QVAQAYMAGS MANGIASADL VVALARAGFL
ASFGAAGVVA ARVDDALADI RRRAPGLAFG CNLIHSPTEA AMERDVVDAC LRHQVRCVEA
SAFLDLTPQV VRYRVAGLRR GPDGRAVADN RVVAKVSRTE VAELFLRPAP AALVRPLVEQ
GLVSAEQAEL AATVAMADDV TAEADSGGHT DRRPLPVLLP ELLALRDSLR GESGGRTVRI
GAAGGLGTPR AVAAAFTLGA DYVVTGSVNQ ASVEAAQSPA TKTLLAQAGV TDCVQAPSAD
MFEIGADVQV LGRGTMFASK ARRLYELYRR FDGLDEIPAD ERAGLEQRIF RRSLDEVWAD
TVSYFSTRDP EQIERAQENP KRRMALVFRW YLGLSSGWSI AGAPDRVADY QVWCGPAMGA
FNTWVRASAL EPLANRHAAV IAAELMRGAA FTSRAAALAQ AGVRLPALAT TYVPRPHLPR
PDSDPGEQAP