Gene Franean1_6188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6188 
Symbol 
ID5674509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7517426 
End bp7519045 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content74% 
IMG OID641245040 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001510438 
Protein GI158317930 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.104121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGGA ATCTTCCATT GGACAGGGTC GTCGGCGGGC CCGTCCCGGC GCGGCGGCGG 
CCGGCGGGCT GCCACGCTGG TGGCATGGAT GCGGTGACGC TGTCGGTGGT GTCCAGTGCG
CTGGGCGGCA TCGCCGAGGA GATGGGGACG CTGCTCGTCC GGAGCGCGTA CTCGTCCAAC
ATCAAGGAGC GGCGCGACTG CTCGGCGGCG ATCTTCGACG CGGACGCCCG GATGATCGCT
CAGGCCGCCC ACGTGCCGGT GCACCTGGGC GCCATGTACG AGTCGGTCCG GGTGGTCGCC
GAGCGCGGGC CGCTGCCTGG TGACGTGTGG GTGCTCAACG ACCCGTTCTC CGGCGGGAAC
CACCTGCCCG ACGTCACGCT GATCTCCCCG CTCGCGTTTG ACGGCGAGGT GGTCGGCTAC
GCGGCCACCC GTGCGCACCA CTCCGACATG GGCGGGATGC GTCCCGGGTC GATGCCCGCG
GACTCCACCG AGATCTTCGC CGAGGGCCTG ATCATCCCGC CGGTCCGGCT CGTCCGGGCG
GGGCGCTGGG AGGAGGACCT GCTCGACCTC ATCTGCGCGA ACTCGCGGAC GCCCGACCTG
CGCCGCGGTG ACCTGCGCGC GCAGGCGGCG GCGAACGGGC TGGCCGCCAC CCGGCTGGCC
GAGCTCGCGC GCCGGCACGG CACCGGGCTC GTCCTGGAGT CGTTCGCCGA GGTACTCCGC
TACGGCGAGC GGCGCAGCCG GGCAGTGATC GCGCGGCTGC CCGACGGCGT CCACCGGGTG
GAGAGCGAGC TCGAGGGCGA CGGCGTCGAC GACCGGGACA TAGCGCTGCG GGTCGCGGTG
ACCATCGAGG GCGACGGCAT CACCGTCGAC TTCACCGGCA CGTCACCGGC GGTGCGCGGC
AACGTCAACT GCCCGCGCGC CGTGACCCGC TCCGCGTGCT GCTTCGCGCT GCGGGTGCTG
CTGCCCGACG ACGTCCCGAC CGGCGACGGC ACCTACGCCC CGCTGACCGT GGTGACCGAG
CCCGGTTCGC TGGTCGACGC GCAACGCCCG TCCGCCGTCG TGGCGGGCAA CGTGGAGACA
TCGCAGCGCA TCGCCGACAC GGTGCTGGCG GCCCTGCGGC TGGCGGTCGG CGCCGGGCCG
GACGTCCTGG CGGCGCCCGG GCAGGGGACG ATGAACAATC TGGTGATCGG CGGCGCGACC
TGGACGTACT ACGAGACGCT GGGCGGGGGG CAGGGCGCGT CCGCCCGCGG CCGTGGCCCC
TCCGGGGTGC ACGTGGGGAT GACGAACACG CTGAACACCC CGATCGAGGC GCTCGAGCTG
GAGTACCCGA TGCGGGTGGA GCGCTACGAG CTGGCCGACG GCACGGGTGG GCCCGGCAGG
CATCCGGGCG GGGACGGCCT CGTGCGCTCG GTGCGGGTGC TGGAGCCGGC GACGCTGTCC
GTCCTGACCG ACCGGCGGCG GCACGCGCCC GGTGGGGTGG CGGGAGGTGG GCCCGGCGCG
GTCGGGCACA ACGATGTTGA CGGTGTTCCG CTCCCACCGA AGGCGAGCCG GCAGCTCCCT
GCCGGGTCGG TCGTCACCCT GCGCACACCG GGCGGTGGCG GATGGGGCCC CCCGCCGTAA
 
Protein sequence
MLRNLPLDRV VGGPVPARRR PAGCHAGGMD AVTLSVVSSA LGGIAEEMGT LLVRSAYSSN 
IKERRDCSAA IFDADARMIA QAAHVPVHLG AMYESVRVVA ERGPLPGDVW VLNDPFSGGN
HLPDVTLISP LAFDGEVVGY AATRAHHSDM GGMRPGSMPA DSTEIFAEGL IIPPVRLVRA
GRWEEDLLDL ICANSRTPDL RRGDLRAQAA ANGLAATRLA ELARRHGTGL VLESFAEVLR
YGERRSRAVI ARLPDGVHRV ESELEGDGVD DRDIALRVAV TIEGDGITVD FTGTSPAVRG
NVNCPRAVTR SACCFALRVL LPDDVPTGDG TYAPLTVVTE PGSLVDAQRP SAVVAGNVET
SQRIADTVLA ALRLAVGAGP DVLAAPGQGT MNNLVIGGAT WTYYETLGGG QGASARGRGP
SGVHVGMTNT LNTPIEALEL EYPMRVERYE LADGTGGPGR HPGGDGLVRS VRVLEPATLS
VLTDRRRHAP GGVAGGGPGA VGHNDVDGVP LPPKASRQLP AGSVVTLRTP GGGGWGPPP