Gene Franean1_1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1221 
Symbol 
ID5669634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1461453 
End bp1463054 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content74% 
IMG OID641240153 
ProductRNA binding metal dependent phosphohydrolase 
Protein accessionYP_001505581 
Protein GI158313073 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.655673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.159074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCGG ACCTTCGCCG GGAGGCGCGT GACGTTGAGG AGGAGGTCGA GCGGATCCGG 
CGCCGGGCCG AGCAGGACGC CGCGGAGCAG ACGGAGCGGG TACGGCGGGA GGCCGAGCAG
ATCCGCCGGC ACGCGGAGGA GGCCGCCGAG GCGATCCGGG AACGGGCGGT CGCGGACGCG
GAGCTGCGGG CGTCGAGGGC GGAGGCGGCC GCCCGCGACG CGATCCACGC GGAGCGCGAG
CAGATCCGCG CCGAGCTCGA CGAGGATCTG CGCACCCAGC GGACCGAGCT GCGCGGCTGG
GACAGCCGGC TCACCCAGCG CGAGCAGCGG GTCACCGACC AGGCTGCCAG CGTGGAGGAG
CGGCTGCGCC GGCTGGAGAC CCGCGAGGCC GAGCTGGCCG TCCGCGAGGC CGGGCTGGAC
AGCCGTGAGT CCGATCTCGG CGAGCTGGAG GAGGCCCGGC GGCGGGAGCT GGAGCGGGTG
GCGGGCCTGA CGTCCGCGGA GGCCCGCACC GAGCTGGTCA AGGTGGTCGA GGACCAGGCC
AGGCTGGACG CCGCCGTCCG GGTGCGTGAC ATCGAGGCCC GAGCCGAGGA GGAGGCCGAG
GACCGGGCGC GCCGGATCGT CACCCTGGCC ATCCAGCGGG TCGCCTCGGA CCAGACCGCC
GAGTCCGTGG TGTCGGTGCT GCACCTGCCC AGCGACGAGA TGAAGGGCCG CATCATCGGG
CGCGAGGGTC GCAACATCCG CGCGTTCGAG TCCGTGACCG GCGTCAACGT GCTCATCGAC
GACACCCCGG AGGCGGTGCT GCTGAGCTGC TTCGATCCCG TGCGCCGGGA GATGGGGCGG
ATCACGCTGA CGGCGCTGGT GTCCGACGGC CGCATCCACC CGCACCGGAT CGAGGAGGAG
TACGCCCGGG CCGAACGCGA GGTCGCGGCG AAGTGCGTCC GCGCCGGTGA GGACGCCCTG
ATCGACGTCG GCATCGCCGA GATGCATCCC GAACTGATCA ACCTGCTGGG CCGGCTGCGC
TACCGCACCA GCTACGGGCA GAACGTGCTC GCGCACCTCG TCGAGAGCGC CCACCTGGCC
GGGATCATGG CGGCCGAGCT GCGGCTGCCC CCGGCGATCG CGAAGCGCGG CACGCTCCTG
CACGACCTCG GCAAGGCGTT GACGCACGAG GTGGAGGGCT CCCACGCGAT CGTGGGTGCG
GAGATCGCCC GCCGCTACGG CGAGCACGAG GACGTCGTCC ACGCCATCGA GGCGCATCAC
AACGAGGTCG AGCCGCGCTC CATCGGGGCC GTGCTGACCC AGGCCGCGGA CCAGATCTCC
GGCGGGCGCC CCGGGGCACG CCGGGACAGC CTCGAGTCCT ACGTCAAGCG CCTCGAGCGC
ATCGAGCAGA TCGCCGCCGA GCGCCCCGGG GTCGAGAAGG TCTTCGCCAT GCAGGCCGGC
CGCGAGGTGC GGGTGATGGT CGTACCCGAG CTGGTCGACG ACGTGGCCGC CCACCTGCTC
GCCCGGGACG TCGCCAAGCA GATCGAGGAC GAGCTCACCT ACCCGGGCCA GATCCGGGTC
ACCGTGGTCC GCGAGACCCG CGCCGTCGGC ATGGCCCGCT AG
 
Protein sequence
MVADLRREAR DVEEEVERIR RRAEQDAAEQ TERVRREAEQ IRRHAEEAAE AIRERAVADA 
ELRASRAEAA ARDAIHAERE QIRAELDEDL RTQRTELRGW DSRLTQREQR VTDQAASVEE
RLRRLETREA ELAVREAGLD SRESDLGELE EARRRELERV AGLTSAEART ELVKVVEDQA
RLDAAVRVRD IEARAEEEAE DRARRIVTLA IQRVASDQTA ESVVSVLHLP SDEMKGRIIG
REGRNIRAFE SVTGVNVLID DTPEAVLLSC FDPVRREMGR ITLTALVSDG RIHPHRIEEE
YARAEREVAA KCVRAGEDAL IDVGIAEMHP ELINLLGRLR YRTSYGQNVL AHLVESAHLA
GIMAAELRLP PAIAKRGTLL HDLGKALTHE VEGSHAIVGA EIARRYGEHE DVVHAIEAHH
NEVEPRSIGA VLTQAADQIS GGRPGARRDS LESYVKRLER IEQIAAERPG VEKVFAMQAG
REVRVMVVPE LVDDVAAHLL ARDVAKQIED ELTYPGQIRV TVVRETRAVG MAR