Gene Franean1_3229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3229 
Symbol 
ID5671604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3815700 
End bp3817466 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content68% 
IMG OID641242122 
ProductX-Pro dipeptidyl-peptidase domain-containing protein 
Protein accessionYP_001507542 
Protein GI158315034 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTACC GAGTCGAGAA GAACGTCATG GTTCCGATGC GGGACGACGT GACGTTATGC 
ACGGATCTCT TCCTGCCGGA AGGCGGGCCC GCGCCGGCGC TCATAATCCG GATGGGCTAC
AGCAAGGAGA TGTTCGAGAA GTTGTCCCTG CCGCTGATCC CGAACGTCCT CTCGCTGGTC
GAGGCCGGCT ACGCGATCGT TTACCAGGAG TGCCGCGGCA CCTACGGCTC CGGGGGCGTC
TTCCGGCCGC TCGTGGATGA CCCGGACGAC GGCGTCGACA CGCTCGAATG GACGGTGAAA
CAGCCGTGGT GCGACGGGAA CGTCGGCAGC TACGGGCTGT CATACCACGG CATGACCCAG
TGGGCGACGG CCTCACAGGC CCCCTCCGGG CTGAAGGCGA TGGCAGTGGC GGCCTCGACG
ACGGACCTCT TCCGCGCCCC GTGGTACAGC GACGGCGGTG CCGTGTCCTG GCAGATGACT
CTGGGCTGGG TGGCGGCCCA GATCGTCACG CTGGGCCAGT ACGCGCTCGA GCGCGGCACA
GGTGACCTCG AGCCGCTGGT CGACGCGGGC GCGATGATGC TCGACCTGGA GCCGCACCTG
CGCAAGCTCC CGATCACCGA TCAGCCAGCG CTGAACAAGC ATGCGCCCTG GTGGAAGGAA
TGGTGGGAGC ACCCCACCCG CGACGAGTTC TGGACCGGCC TGGCGACGGC CGAACACACC
CGGGACATGA CGACTCCGGC GCTGCACATC GTCGGCTGGT TCGACTTCTT CGCGCCCGAG
GCGACGCGTG CCTACACCCG AATGCGTGCC CAGGCAGCCA CGCCACAGGC ACGGGAGGGC
CAGCGGCTGA TCGTAGGCCC CTGGGACCAC ACCTACCAGG ATGCCGCCTA CCGGTCCCGC
GAGTTCGGCC AGCTGGCCGG CGCACCGTAC GCCGACATCA CCGGCGCGCA CCTGCGGTTC
TTCGACCGGC ACCTGCGCGG CAACAACAGC GCCGACGTCG GCGCAAGCCC AGTACGGATC
TTCGTGATGG GTGTGGACCA GTGGCGGGAT GAGCAGGACT GGCCACTGCC CGACACCACC
TACGTCGACT ACTACCTGGA CGGGCCCGGC CGCGCGAACA CCGCCGACGG CGACGGCGTG
CTCACGACCG AGGCCCCCAC CACCGAGGCG GCCGAGTCCT ACCGCTTCGA CCCGCTCGAT
CCGGTGCCGA CGCTGGGCGG CCGGCTCAAC CAGATGGGTT TCGGTTTCTC CGCTCTGTAC
TCGGGGCCGG TCGACCAGCG CCCAGTCGAA GAGCGCAACG ACGTTCTGTG CTTCACCACG
CCGGTGCTGG AGGAGCCGGT CGAGGTCACC GGGAACATAT CGCTGGTGCT GCATGCGTCG
AGCTCCGCGC TGGACACCGA CTTCACCGGC AAGCTCGTCG ACGTCCACCC CGACGGCCGG
GCGCTCTACC TGACCGACGG CATCCTGCGT GCCCGCTACC GCGAGTCGCT GGCCAACCCG
AAGCCACTCG TGCCCGGCGA GGTCTACGAG CTCATCCTCG ACCTCGGCCT GACCTCCAAC
GTCTTCCTGC CTGGCCACCG CATCCGGCTC GAGGTCTCCT CCAGCAACTT CCCGCGCTAC
GACCGGAACA CAAATACCGG CAACGTGATC TCCTTCGACA CGGCCACCCC GGTCGTCGCG
GGAAACCAGA TCCTCCACGG CCCGGCGCAT CCCAGCCGGC TCGTTCTGCC GATCATCCGG
CGCCTGCGGA CCGACCGACA CGGCTGA
 
Protein sequence
MSYRVEKNVM VPMRDDVTLC TDLFLPEGGP APALIIRMGY SKEMFEKLSL PLIPNVLSLV 
EAGYAIVYQE CRGTYGSGGV FRPLVDDPDD GVDTLEWTVK QPWCDGNVGS YGLSYHGMTQ
WATASQAPSG LKAMAVAAST TDLFRAPWYS DGGAVSWQMT LGWVAAQIVT LGQYALERGT
GDLEPLVDAG AMMLDLEPHL RKLPITDQPA LNKHAPWWKE WWEHPTRDEF WTGLATAEHT
RDMTTPALHI VGWFDFFAPE ATRAYTRMRA QAATPQAREG QRLIVGPWDH TYQDAAYRSR
EFGQLAGAPY ADITGAHLRF FDRHLRGNNS ADVGASPVRI FVMGVDQWRD EQDWPLPDTT
YVDYYLDGPG RANTADGDGV LTTEAPTTEA AESYRFDPLD PVPTLGGRLN QMGFGFSALY
SGPVDQRPVE ERNDVLCFTT PVLEEPVEVT GNISLVLHAS SSALDTDFTG KLVDVHPDGR
ALYLTDGILR ARYRESLANP KPLVPGEVYE LILDLGLTSN VFLPGHRIRL EVSSSNFPRY
DRNTNTGNVI SFDTATPVVA GNQILHGPAH PSRLVLPIIR RLRTDRHG