Gene Franean1_3231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3231 
Symbol 
ID5671606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3818821 
End bp3820425 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content69% 
IMG OID641242124 
ProductX-Pro dipeptidyl-peptidase domain-containing protein 
Protein accessionYP_001507544 
Protein GI158315036 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCA ACCTGATCAG CCGGCTGTTA CAGCGAACCC AGCAGTTACC ACCGCCACTC 
ACCCGCGATC TCACCGTCGC ACGCGGCCTG CGGGTGCCGA TGCGTGACGG CGTCGAGCTG
ACCGCCGACC ACTGGTTCCC CAGGGCCGGC GCCGCGGGGC TGCCGACCGT GCTGATCCGC
ACCACCTACG GCAGCCACAG CTCAGCCACG TACCCGATCG TGCGGCCCAT CGCCGAGCGC
GGGTTCCAGG TGCTGATCAC CAACTCGCGC GGCACCTTCG GCTCCGGCGG CGCCTTCGAC
CCGTTCCGCA ACGAACGCGA CGACGGCTTC GACACCCTCG ACTGGGTCAT CGGACAACCC
TGGTTCGGCG ACTCCATCGT GCTCTATGGA CCCAGCTACC TCGGCTACAC CCAGTGGGCC
GTCGCCGATC AGGTGCCGCC GCAGGTCAAG GCCATGATCC CGATTCAGAG CGAAGCCGCG
GTCATGCTGG AATTCCTGCG CCCGGACGGC TTCGCGCTGG AGATCCTGTT CATCTGGAGC
TTCGTGGTGG ACGGGCAGGA AAGCCCGCTG GCTCTGCTCA GACACCCCGC CCTGGGCGGT
AGGCGGAAGA TGCGCCGACT GATGGCCAGC CTGCCGCTCG AGCAGGCCGA CCTGCGCGGC
GCCGGCCACC GGATCGACTA TCTGCAGAAC ATCCTGGCCC ACGATGCCGG CTCACCGCAC
TGGGCGCCTG CCGACCACAG CGCCCGGGTC GCCGACGTGA CGATCCCGGT CAGCTCCATC
GCCGGCTGGC ACGACTTCTT CCTGCCCGGC CAGCTACGTG ACTTCACCGC CCTGCAGGCC
GCCGGCCGCC CGGCCCGGCT GACCGTCGGA CCGTGGGCGC ACAGCATGTC CGCCGGGCCC
ATCAGGCTGG GCATGGAGGA ACTACTCGAC TTCGGCCTGG CCCACGCCCG CGCGGAGCAA
CCGACCGACC GAGCCCCGGT CCGTCTGTTC GTGCAGGGCG CCGACGAATG GCGGGACTTC
CAGTCCTGGC CCCCGGAAGG CTACCCACAG CAACGTCTGC ACCTGCAACC GGGCGGCGGC
CTGGCGACAG CGAGCCCGGC GGACTCGCCC CCGGACAACT ACCGCTACGA CCCCGCCGAC
CCGACTCCCG CCGTCGGTGG ATCGCGCTTC AACGTCAACA CCGGCAGCGT CGACAACACC
GCTCTCGAGG CCCGGGCGGA CGTGCTGACC TTCACCACAC TGCCGCTGGA CCGTGAGGTC
GAGGTGATCG GTGAGGTCGA CGCCGAGATC TGGTTCCGGT CCAGCCTGCC GTACGCGGAC
GTGTTCGTCA GGGTCTGTGA CGTCAACACC GGCGACCGCT CCTACAACGT CACCGACGGC
CTGACCAGTC TCACCGAGGC GGACCAGGAC ACCCGGGCGA GGGTCCGGCT CCCCGCCACC
GCGTACCGGT TCAGGAAGGG CCACCGCATC CGTGTCCAGA TCTCCAGCGG CGCTTTTCCC
CGGTACAACC GCAACCCCGG CACCGGAGAA CCCCGCGGCA GCGGACAACT CTCAACGCCG
CCAGTCAGAC CATTTACCAC GACCCGGCCC GCCCGTCAGC GGTGA
 
Protein sequence
MTLNLISRLL QRTQQLPPPL TRDLTVARGL RVPMRDGVEL TADHWFPRAG AAGLPTVLIR 
TTYGSHSSAT YPIVRPIAER GFQVLITNSR GTFGSGGAFD PFRNERDDGF DTLDWVIGQP
WFGDSIVLYG PSYLGYTQWA VADQVPPQVK AMIPIQSEAA VMLEFLRPDG FALEILFIWS
FVVDGQESPL ALLRHPALGG RRKMRRLMAS LPLEQADLRG AGHRIDYLQN ILAHDAGSPH
WAPADHSARV ADVTIPVSSI AGWHDFFLPG QLRDFTALQA AGRPARLTVG PWAHSMSAGP
IRLGMEELLD FGLAHARAEQ PTDRAPVRLF VQGADEWRDF QSWPPEGYPQ QRLHLQPGGG
LATASPADSP PDNYRYDPAD PTPAVGGSRF NVNTGSVDNT ALEARADVLT FTTLPLDREV
EVIGEVDAEI WFRSSLPYAD VFVRVCDVNT GDRSYNVTDG LTSLTEADQD TRARVRLPAT
AYRFRKGHRI RVQISSGAFP RYNRNPGTGE PRGSGQLSTP PVRPFTTTRP ARQR