Gene Franean1_2283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2283 
Symbol 
ID5670682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2727364 
End bp2728785 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content75% 
IMG OID641241203 
Productamidohydrolase 
Protein accessionYP_001506624 
Protein GI158314116 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.442261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.362422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCCAC CACCGGCGGG TCCCGCTCCG CTGCCCGCCG ACTTCGTCGT GCGGGCCCGG 
CACGTCCTGA CCATGGGGCC CCGCGGCCAC CTGCGGGACG CCGCGGTCGC CGTCGTCGGT
GGGCGGATCG CCGCCGTCGA CACCGCGGCG GACGTCCGGG CGCGGTTCGC GGACCTGCCG
GTGGTCGGGG ACGGCGGCGG GATCCTGATC CCGGGGCTGG TCAGCGCGCA CGGGCACTTC
TCCGAGGGGC TCGTCACCGG CATCGGTGAG ACGCACACGC TGTGGGAGTG GTTCGTCCGC
GTCGTCGAGC CCATCGAGGG GCACCTCACC CGGGACATGG CCTACGTCGG GACGCTGCTC
AAGGCGGCCG AGCTGGCCTG TTCCGGGGTG ACGACGGTCG CAGACATGTT CTGCTCGGCG
GCCGGGGCCA CCCCGGTCAC CCCCGGGGTG GTCGACGCCC TCGACGCCGT CGGCCTACGC
GGCGATGTGT CGTTCGGCCC GGCGGACTCG GCGAACCCCC GGCCGGTCGC GGCCGTCCTC
GCCGAGCACG CGGCGCTCGC CGACGCGGCC CGCAACTCCC GCCGGACCAC CTTCCGGGTG
GGCCTGGCGA CCGTGCCGTC GAGCAGCGAC GAGCTGCTCG ACGAGACGGC CCGGCTGGTC
GCGCAGACCG GCCGGCTGCA CGTCCACCTG CACGAGATCC GCGAGGAGGT GACCGCGTCG
CGGACGACGC GCGGCACCGG GTCGATCGAG TTCGCCGCGC GGCGCGGGCT GCTCGACGCG
CAGGTCGTGG CCGCGCACTG CGTGTGGCTC GACGACACCG ACGTAGAACT GCTGCGCCGG
CACCGGGTCG CGGTCGCGCA CTGCCCCGTC TCGAACATGA TCCTCGCCAG CGGGGTGTGC
CAGGTCCCGC GGCTGCTGCG CGACGGGTTC ACCGTCGCGC TCGGCGTGGA CGGCGCGGCG
AGCAACGACA GCCAGAACAT GCTGGAGACG ATGAAAATCG CCGCCCTGCT GCAGAAGGTG
CACCACCTGC AGGCGACGGC CCTGACGGCG CCGACGGTGC TGCGGATGGC GACCATCGAG
GGTGCGCGGG CGCTCGGGCT CGCCGACGAG GTCGGCTCCC TGGAGGTCGG CAAGGCCGCC
GACCTGGTCT ACCTCGCCGA GGCGAGCCCG TCGCTGGCGC TCGTGCACGA CCCCTACCAG
GCGGTTGTCT ACTGTGCCTC CCCGCGGGAC GTCACCGGGG TGTGGGTGGC CGGTGAGCGG
GTCGTCGCCG ACGGGCGGCT GGTCGCCGTC GATCTCGGGC CGGTCCTGCC GTGGGCGCGT
GAGCTGGCCG TCGAGCTCGC CAGCCGGGCC GGGCTGGACT CCGAGCTGCG CTCCGCCGCG
GCCGGCCCGC CAGTGGAAGT GGTGCCCGGC GCGGCGCGGT GA
 
Protein sequence
MTPPPAGPAP LPADFVVRAR HVLTMGPRGH LRDAAVAVVG GRIAAVDTAA DVRARFADLP 
VVGDGGGILI PGLVSAHGHF SEGLVTGIGE THTLWEWFVR VVEPIEGHLT RDMAYVGTLL
KAAELACSGV TTVADMFCSA AGATPVTPGV VDALDAVGLR GDVSFGPADS ANPRPVAAVL
AEHAALADAA RNSRRTTFRV GLATVPSSSD ELLDETARLV AQTGRLHVHL HEIREEVTAS
RTTRGTGSIE FAARRGLLDA QVVAAHCVWL DDTDVELLRR HRVAVAHCPV SNMILASGVC
QVPRLLRDGF TVALGVDGAA SNDSQNMLET MKIAALLQKV HHLQATALTA PTVLRMATIE
GARALGLADE VGSLEVGKAA DLVYLAEASP SLALVHDPYQ AVVYCASPRD VTGVWVAGER
VVADGRLVAV DLGPVLPWAR ELAVELASRA GLDSELRSAA AGPPVEVVPG AAR