Gene Franean1_3241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3241 
Symbol 
ID5671616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3830032 
End bp3831795 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content70% 
IMG OID641242134 
ProductX-Pro dipeptidyl-peptidase domain-containing protein 
Protein accessionYP_001507554 
Protein GI158315046 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.972915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCGCCG AAGAGGAGCT GATCGTCGAG AAGGGCGTCA TGGTTCCGAT GCGCGACGGT 
GTGCGCCTGT ACACGGACAT CTACCGGCCG GCCGGTCCCG GGCCGTTTCC TGTGCTTGTG
TCCCGTACCG CCTACTGGCT CAACGGCGGG GTCACCCAGG GCCTGAGCGG TTTCGCCAAA
CTGGTCGCCC GGCAGGGCTA TGTGGCCGTG TTCCAGCAGA GTCGCGGCCG GTTCGCTTCC
GAGGGTGTGT TCCACCCGGC GCTGTGCGAC GTGGACGACG GGTACGACGT GGTGGAGTGG
GCGGCCGCCC AGCCGTGGTC CACCGGCAAG GTCGGTATGT TCGGCGGGTC GTACCAGGGG
CTCACCCAGT GGGCGGCCGC CATCGCCCGG CCACCGCATC TCGCGTGCAT CGCGCCGTTG
ACGTCCACCT GGAACACCTT CGGCAATGAG ATCTGGTACG CCGCGCCGGG CGTCCTGTCG
CTGGGCAGCG CGTTCGCGTG GGCGTGGGGC GCGGTGCTCG GCGAGGCCGA GCGCCGGGGC
GTTCCGGCGC CGGAGGGCGC GATCCACCAC GGTGAGGAGG GCAACGAGCC CGGGGACGTC
GCGGAGGCCA TCGCGAAGCG CACCGTGGCG ATGATGGAGA TGTACGCCTT CCGCCCGCTG
CGCGACGCGC CCCAGCTCGA GCTGGTGTCG TGGTGGAAGG ACTGGTGCGA CAACGGCGAC
CCGAACGACC CGTACTGGCT GGTGGTCAAC GCCTCGGAGC ACGCCGTCGA CCTCGACCTG
CCGATCTTCC AGGCGAGTGG CTGGTACGAC ATGTTCCTCA ACGGGACGCT CGAAGCCTTC
CAGGCGCTGC GCGGCGCGGG CGCCACCCAG TACGTCCGGG ACAACCAGGA GTTGGTCATC
GGACCGTGGA ACCACGGCGG CGTGTGCCCG CCCCGTCCGG ACGCACCGGC GGACACCGGG
CCCCTGGGGC TCTGGGACCT CTCCGAGGGT TCCGCCTGCA TGGAGTTCTT CCGCCGTCAC
CTGAAGGGCG AGCAGGTCCT GGACCCCGCG CCGGTCCGCC TGTTCGTCAT GGGCGAGAAC
ATCTGGCGCG ACGAGCGGGA ATGGCCGCTG GCCCGCACCC GCTGGACGCC CTACTACCTG
CACAGTGCCG GCGGGGCCAA CACCGCCGCG GGCGACGGCT CGCTGTCCAC GGAGCGTCCC
GGTGACCAGC CGGACGACGT CTTCGTCTAC GACCCGCAGA ACCCGGTCGT GTCGCAGGGC
CGGCTGGAGT GCTACGCCCC CGACCACGGC GCCGAGACCG CCCGCAACGA GAGCCGCGAC
GACGTTCTCG TCTACACCAC GTCACCGCTC GAGCATGACC TGGAGGTCAC CGGCCCGGTG
ACGCTCGAGC TCTGGGCGTC GTCGTCCGTC TCCGACACCG ACTTCACCGC GAAGCTCGTC
GATGTCTTCC CGGACGGCGC GGCCATACCG CTCGCCGAGG GCGTGGTGCG GACAGGCGTG
GCGTTCACGC AGCCACCGCG GCCCGAGACG CCGCGTCGTT ACCGAATCAG TCTCTGGGCG
ACGAGCAACG TCTTCAGATC CGGTCACCGT ATCCGGCTCG ACGTCTCGTC GAGCGCGTTC
CCGGAGTACG AGCTGAACCC GAATACCGGT CAGCGGATCA CCCACGACGC CACCGGGAAG
ACGGTGCCCG CGACGCAGCG GGTGCACCAT GACCGCCGCT TTCCTTCACG CCTTGTCCTG
CCGGTGATCC CGCGGCCCGC GTAG
 
Protein sequence
MIAEEELIVE KGVMVPMRDG VRLYTDIYRP AGPGPFPVLV SRTAYWLNGG VTQGLSGFAK 
LVARQGYVAV FQQSRGRFAS EGVFHPALCD VDDGYDVVEW AAAQPWSTGK VGMFGGSYQG
LTQWAAAIAR PPHLACIAPL TSTWNTFGNE IWYAAPGVLS LGSAFAWAWG AVLGEAERRG
VPAPEGAIHH GEEGNEPGDV AEAIAKRTVA MMEMYAFRPL RDAPQLELVS WWKDWCDNGD
PNDPYWLVVN ASEHAVDLDL PIFQASGWYD MFLNGTLEAF QALRGAGATQ YVRDNQELVI
GPWNHGGVCP PRPDAPADTG PLGLWDLSEG SACMEFFRRH LKGEQVLDPA PVRLFVMGEN
IWRDEREWPL ARTRWTPYYL HSAGGANTAA GDGSLSTERP GDQPDDVFVY DPQNPVVSQG
RLECYAPDHG AETARNESRD DVLVYTTSPL EHDLEVTGPV TLELWASSSV SDTDFTAKLV
DVFPDGAAIP LAEGVVRTGV AFTQPPRPET PRRYRISLWA TSNVFRSGHR IRLDVSSSAF
PEYELNPNTG QRITHDATGK TVPATQRVHH DRRFPSRLVL PVIPRPA