Gene Franean1_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0331 
Symbol 
ID5668755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp397246 
End bp398352 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content75% 
IMG OID641239262 
ProductNUDIX hydrolase 
Protein accessionYP_001504703 
Protein GI158312195 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.100097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATC ATCAGGCGGC AATCGTCGCC CGGTTCGGCA CGTCCCGCCG GATGGAACCG 
GAGGAGGAAC TCCGGCTGGC CCGGCTGGCC GCCAACCCGG CACCCGCCCG GGACGCGGCG
ACCGTTGTGC TCCTGCGCGA CGCGCCGGCC GGTTCCGGCA TCGAGGCCTA CCTGCTCAGG
CGCACCCGGG CGATGTCGTT CGCGGGCGGC ATGCACGTCT TCCCGGGCGG GCGGGTCGAC
CCGTCGGACG CCGCAGAGGA TCTCCCGTGG GTCGGGCCCT CCGTCGAGGA GGCGATGCCG
GGGCTGGACG ACGATCCCGC CCGCGCGCGG GCCTTGGTCT GCGCGGCCGT CCGCGAGACC
TTCGAGGAGT GCGGCGTCCT GCTCGCCGTA CCGACGGGCA CATCCGCAGC CAGCAGAGTC
GCGCCGGCCG GCGGAGCCAC GCCGACGGGC AGGAACGTGC CGGCCGGACG TGTCGACGCG
GCGGGCGGCG CTGCCGGTAC CGGCGATCCG GGTACCGGCG ATCCGGGCTG GGCCGCCGAG
CGGCGGGCGG TGGAGAGCCA CCGCAGTGGC CTCGCCGAGC TGCTGACCCG CCGCGGGCTG
GCCCTGCGGG CCGACCTGCT GGCCCCGTGG ACCCGTTGGA TAGCCCCCGA GCTGGAGCCA
CGGCGGTACG ACACCAGGTT CTTCGTCGCC GCGCTGCCGG CCGGGCAGCT GCCGGGCGAG
CTCGCGACGG AACTCTCGAC CGAGGCTGAC GGGATGCTGT GGATCCGTCC GGCGGAGGCG
ATGGAGCGGT TTGTCGCCGG CGAGATCGGC ATGCTCCCGC CCACCGCCTT CACCCTCGCG
GAGCTGTCGG CCTACGACGA CGTCGCCGGT GCGCTCGCGG CCGCGCGCAC CCGCGACCTG
AAGCCGATCA TGGCAAGGAT CATCGCCGGC GACGGCACCT GGCAGCTGTC GTTCCCACAC
CTGTTACCGC TGGACGGTGC CCCCGGCACA CCGCTGGACG GTGCCCCCGG CACACCGCTG
GACGGTGCCC TCGGCACCGA GCCGGGCACG CCTTCGAAAA CCGCCCCGGC ACCGGCCGCG
GGCACCCGCG GTGGTGTCCC GCGGTGA
 
Protein sequence
MADHQAAIVA RFGTSRRMEP EEELRLARLA ANPAPARDAA TVVLLRDAPA GSGIEAYLLR 
RTRAMSFAGG MHVFPGGRVD PSDAAEDLPW VGPSVEEAMP GLDDDPARAR ALVCAAVRET
FEECGVLLAV PTGTSAASRV APAGGATPTG RNVPAGRVDA AGGAAGTGDP GTGDPGWAAE
RRAVESHRSG LAELLTRRGL ALRADLLAPW TRWIAPELEP RRYDTRFFVA ALPAGQLPGE
LATELSTEAD GMLWIRPAEA MERFVAGEIG MLPPTAFTLA ELSAYDDVAG ALAAARTRDL
KPIMARIIAG DGTWQLSFPH LLPLDGAPGT PLDGAPGTPL DGALGTEPGT PSKTAPAPAA
GTRGGVPR